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GENES OF CAROTENOED BIOSYNTHESIS AND METABOLISM 
AND METHODS OF USE THEREOF 

R A CK OP OT TND OF THF INVENTION 
Field of the Invention 

5 The present invention describes nucleic acid sequences for eukaryotic genes encoding 

e lycopene e-cyclase (also known as e-cyclase and e lycopene cyclase), isopentenyl 
pyrophosphate isomerase (IPP) and p -carotene hydroxylase as well as vectors containing the 
same and hosts transformed with said vectors. The present invention also provides methods 
for augmenting the accumulation of carotenoids, changing the composition of the 
10 carotenoids, and producing novel and rare carotenoids. The present invention provides 

methods for controlling the ratio or relative amounts of various carotenoids in a host. The 
invention also relates to modified lycopene e-cyclase, IPP isomerase and P -carotene 
hydroxylase. Additionally, the present invention provides a method for screening for genes 
and cDNAs encoding enzymes of carotenoid biosynthesis and metabolism. 

15 Backgroun d of the Invention 

Carotenoid pigments with cyclic endgroups are essential components of the 
photosynthetic apparatus in oxygenic photosynthetic organisms {e.g., cyanobacteria, algae 
and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment P- 
carotene (or, in rare cases, the asymmetrical bicyclic a -carotene) is intimately associated with 

20 the photosynthetic reaction centers and plays a vital role in protecting against potentially 
lethal photooxidative damage (Koyama, 1991). p -carotene and other carotenoids derived 
from it or from a -carotene also serve as light-harvesting pigments (Siefermann-Harms, 
1987), are involved in the thermal dissipation of excess light energy captured by the light- 
harvesting antenna (Demmig- Adams & Adams, 1992), provide substrate for the biosynthesis 

25 of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991; Parry & Horgan, 1991), 
and are precursors of vitamin A in human and animal diets (Krinsky, 1987). Plants also 
exploit carotenoids as coloring agents in flowers and fruits to attract pollinators and agents of 
seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of agronomic 
value in a number of important crops. Carotenoids are currently harvested from a variety of 

30 organisms, including plants, algae, yeasts, cyanobacteria and bacteria, for use as pigments in 
food and feed. 
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The probable pathway for formation of cyclic carotenoids in plants, algae and 
cyanobacteria is illustrated in Figure 1 . Two types of cyclic endgroups or rings are 
commonly found in higher plant carotenoids, these are referred to as the P (beta) and e 
{epsilon) rings (Fig. 3). The precursor acyclic endgroup (no ring structure) is referred to as 
the T (psi) endgroup. The P and e endgroups differ only in the position of the double bond in 
the ring. Carotenoids with two p rings are ubiquitous, and those with one P and one e ring 
are common, but carotenoids with two e rings are uncommon. P-carotene (Fig. 1) has two P- 
endgroups and is a symmetrical compound that is the precursor of a number of other 
important plant carotenoids such as zeaxanthin and violaxanthin (Fig. 2). 

Genes encoding enzymes of carotenoid biosynthesis have previously been isolated 
from a variety of sources including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 
254-268; Misawa et al., 1990, J. Bacterid., 172, 6704-12), fungi (Schmidhauser et al., 1990, 
Mol. Cell. Biol. 10, 5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 
482-86; Cunningham et al., 1994) and higher plants (Bartley et al., Proc. Natl. Acad. Sci USA 
88, 6532-36; Martinez-Ferez & Vioque, 1992, Plant Mol. Biol. 18, 981-83). Many of the 
isolated enzymes show a great diversity in structure, function and inhibitory properties 
between sources. For example, phytoene desaturases from the cyanobacterium 
Synechococcus and from higher plants and green algae carry out a two-step desaturation to 
yield C-carotene as a reaction product. In plants and cyanobacteria a second enzyme (£- 
carotene desaturase), similar in amino acid sequence to the phytoene desaturase, catalyzes 
two additional desaturations to yield lycopene. In contrast, a single desaturase enzyme from 
Erwinia herbicola and from other bacteria introduces all four double bonds required to form 
lycopene. The Erwinia and other bacterial desaturases bear little amino acid sequence 
similarity to the plant and cyanobacterial desaturase enzymes, and are thought to be of 
unrelated ancestry. Therefore, even with a gene in hand from one source, it may be difficult to 
identify a gene encoding an enzyme of similar function in another organism. In particular, 
the sequence similarity between certain of the prokaryotic and eukaryotic genes encoding 
enzymes of carotenoid biosynthesis is quite low. 

Further, the mechanism of gene expression in prokaryotes and eukaryotes appears to 
differ sufficiently such that one cannot expect that an isolated eukaryotic gene will be 
properly expressed in a prokaryotic host. 
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The difficulties in isolating genes encoding enzymes with similar functions is 
exemplified by recent efforts to isolate the gene encoding the enzyme that catalyzes the 
formation of p-carotene from the acyclic precursor lycopene. Although a gene encoding an 
enzyme with this function had been isolated from a bacterium, it had not been isolated from 
5 any photosynthetic procaryote or from any eukaryotic organism. The isolation and 

characterization of the enzyme catalyzing formation of p-carotene in the cyanobacterium 
Synechococcus PCC7942 was described by the present inventors and others (Cunningham et 
al., 1993 and 1994). The amino acid sequence similarity of the cyanobacterial enzyme to the 
various bacterial lycopene p-cyclases is so low (ca. 18-25% overall; Cunningham et al., 
10 1994) that there is much uncertainty as to whether they share a common ancestry or, instead, 
represent an example of convergent evolution. 

The need remains for the isolation of eukaryotic and prokaryotic genes and cDNAs 
encoding polypeptides involved in the carotenoid biosynthetic pathway, including those 
encoding a lycopene e-cyclase, IPP isomerase and p-carotene hydroxylase. There remains a 
1 5 need for methods to enhance the production of carotenoids, to alter the composition of 

carotenoids, and to reduce or eliminate carotenoid production. There also remains a need in 
the art for methods for screening for genes and cDNAs encoding enzymes of carotenoid 
biosynthesis and metabolism. 

flTT MlY^V r>F TFTF, TNVRNTTON 
20 Accordingly, a first object of this invention is to provide purified and/or isolated 

nucleic acids which encode enzymes involved in carotenoid biosynthesis; in particular, 
lycopene e-cyclase , IPP isomerase and p-carotene hydroxylase. 

A second object of this invention is to provide purified and/or isolated nucleic acids 
which encode enzymes which produce novel or uncommon carotenoids. 
25 A third object of the present invention is to provide vectors containing said genes. 

A fourth object of the present invention is to provide hosts transformed with said 
vectors. 

Another object of the present invention is to provide hosts which accumulate novel or 
uncommon carotenoids or which accumulate greater amounts of specific or total carotenoids. 
30 Another object of the present invention is to provide hosts with inhibited and/or 

altered carotenoid production. 
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Another object of this invention is to secure the expression of eukaryotic carotenoid- 
related genes in a recombinant prokaryotic host. 

Yet another object of the present invention is to provide a method for screening for 
eukaryotic and prokaryotic genes and cDNAs which encode enzymes involved in carotenoid 

5 biosynthesis and metabolism. 

An additional object of the invention is to provide a method for manipulating 
carotenoid biosynthesis in photosynthetic organisms by inhibiting the synthesis of certain 
enzymatic products to cause accumulation of precursor compounds. 

Another object of the invention is to provide modified lycopene e-cyclase, IPP 
1 0 isomerase and p-carotene hydroxylase. 

These and other objects of the present invention have been realized by the present 

inventors as described below. 

A subject of the present invention is an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p-carotene 
15 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21 or 23-27. 

The invention also includes vectors which comprise any of the nucleic acid sequences 
listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of producing or enhancing the 
20 production of a carotenoid in a host cell, comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase, IPP isomerase or p-carotene hydroxylase enzyme activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence to produce the protein. 

Yet another subject of the present invention is a method of modifying the production 
of carotenoids in a host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or P -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 
30 sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 

sequence in the host cell to modify the production of the carotenoids in the host cell, relative 
to the untransformed host cell. 



25 
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The present invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein having lycopene e-cyclase, 
IPP isomerase or p-carotene hydroxylase enzyme activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, wherein the 
5 heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous nucleic acid 
sequence which encodes for a protein which modifies lycopene e-cyclase, IPP isomerase or 
p-carotene hydroxylase enzyme activity in the host cell, relative to an untransformed host 
10 cell, the method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is a method for screening for genes and 
cDNAs which encode enzymes involved in carotenoid biosynthesis and metabolism. 

15 ftflTFT? TVRSHR TpTTON OF THE PR AWTNGS 

A more complete appreciation of the invention and many of the attendant advantages 
thereof will be readily obtained as the same becomes better understood by reference to the 
following detailed description when considered in connection with the accompanying 
drawings, wherein: 

20 Figure 1 is a schematic representation of the putative pathway of p -carotene 

biosynthesis in cyanobacteria, algae and plants. The enzymes catalyzing various steps are 
indicated at the left. Target sites of the bleaching herbicides NFZ and MPTA are also 
indicated at the left. Abbreviations: DMAPP, dimethylallyl pyrophosphate; FPP, famesyl 
pyrophosphate; GGPP, geranylgeranyl pyrophosphate; GPP, geranyl pyrophosphate; IPP, 
25 isopentenyl pyrophosphate; LCY, lycopene cyclase; MVA, mevalonic acid; MPTA, 2-(4- 
methylphenoxy)triethylamine hydrochloride; NFZ, norflurazon; PDS, phytoene desaturase; 
PSY, phytoene synthase; ZDS, C-carotene desaturase; PPPP, prephytoene pyrophosphate. 

Figure 2 depicts possible routes of synthesis of cyclic carotenoids and common plant 
and algal xanthophylls (oxycarotenolds) from neurosporene. Demonstrated activities of the 
30 P- and e-cyclase enzymes of A. thaliana are indicated by bold arrows labelled with p or e 
respectively. A bar below the arrow leading to e-carotene indicates that the enzymatic 
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activity was examined but no product was detected. The steps marked by an arrow with a 
dotted line have not been specifically examined. Conventional numbering of the carbon 
atoms is given for neurosporene and a -carotene. Inverted triangles (▼) mark positions of the 
double bonds introduced as a consequence of the desaturation reactions. 
5 Figure 3 depicts the carotene endgroups which are found in plants. 

Figure 4 is a DNA sequence and the predicted amino acid sequence of a lycopene e- 
cyclase cDNA isolated from A. thaliana (SEQ ID NOS: 1 and 2). These sequences were 
deposited under Genbank accession number U50738. This cDNA is incorporated into the 
plasmid p ATeps. 

10 Figure 5 is a DNA sequence encoding the p-carotene hydroxylase isolated from A. 

thaliana (SEQ ID NO: 3). This cDNA is incorporated into the plasmid pATOHB. 

Figure 6 is an alignment of the predicted amino acid sequences of A. thaliana (J- 

carotene hydroxylase (SEQ ID NO: 4) with those of the bacterial p-carotene hydroxylase 

enzymes from Alicalgenes sp. (SEQ ID NO: 5) (Genbank D58422), Erwinia herbicola EholO 
15 (SEQ ID NO.: 6) (GenBank M872280), Erwinia uredovora (SEQ ID NO.: 7) (GenBank 

D90087) and Agrobacterium aurianticum (SEQ ID NO.: 8) (GenBank D58420). A 

consensus sequence is also shown. All five genes are identical where a capital letter appears 

in the consensus. A lowercase letter indicates that three of five, including A. thaliana, have 

the identical residue. TM; transmembrane. 
20 Figure 7 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from A. 

thaliana (SEQ ID NO: 9). This cDNA is incorporated into the plasmid pATDP5. 

Figure 8 is a DNA sequence of a second cDNA encoding another IPP isomerase 

isolated fromA thaliana (SEQ ID NO: 10). This cDNA is incorporated into the plasmid 

pATDP7. 

25 Figure 9 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from 

Haematococcus pluvialis (SEQ ID NO: 1 1). This cDNA is incorporated into the plasmid 
pHP04. 

Figure 10 is a DNA sequence of a second cDNA encoding another IPP isomerase 
isolated from Haematococcus pluvialis (SEQ ID NO: 12). This cDNA is incorporated into 

30 the plasmid pHP05. 

Figure 1 1 is an alignment of the amino acid sequences predicted by IPP isomerase 
cDNAs isolated from A. thaliana (SEQ ID NO.: 16 and 18), H. pluvialis (SEQ ID NOS.: 14 
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and 15), Clarkia breweri (SEQIDNO.: 17) (See, Blanc & Pichersky, Plant Physiol. (1995) 
108:855; Genbank accession no. X82627) and Saccharomyces cerevisiae (SEQ ID NO.: 19) 
(Genbank accession no. J05090). 

Figure 1 2 is a DNA sequence of the cDNA encoding an IPP isomerase isolated from 
Tagetes erecta (marigold; SEQ ID NO: 13). This cDNA is incorporated into the plasmid 
pPMDPl. xxx's denote a region not originally sequenced. Figure 21 A shows the complete 
marigold sequence. 

Figure 13 is an alignment of the consensus sequence of four plant p -cyclases (SEQ ID 
NO.: 20) with the A. thaliana lycopene e-cyclase (SEQ ID NO.: 21). A capital letter in the 
plant p consensus is used where all four p-cyclase genes predict the same amino acid residue 
in this position. A small letter indicates that an identical residue was found in three of the 
four. Dashes indicate that the amino acid residue was not conserved and dots in the sequence 
denote a gap. A consensus for the aligned sequences is given, in capital letters below the 
alignment, where the P- and e-cyclases have the same amino acid residue. Arrows indicate 
some of the conserved amino acids that will be used as junction sites for construction of 
chimeric cyclases with novel enzymatic activities. Several regions of interest including a 
sequence signature indicative of a dinucleotide-binding motif and two predicted 
transmembrane (TM) helical regions are indicated below the alignment and are underlined. 

Figure 14 shows the nucleotide (SEQ ID NO:22) and amino acid sequences (SEQ ID 
NO:23) of the Adonis palaestina (pheasant's eye) e-cyclase cDNA #5. 

Figure 15 A shows the nucleotide (SEQ ID NO:24) and amino acid sequences (SEQ 
ID NO:25) of a potato e-cyclase cDNA. Figure 15B shows the amino acid sequence (SEQ ID 
NO:26) of a chimeric lettuce/potato lycopene e-cyclase. Amino acids in lower case are from 
the lettuce cDNA and those in upper case are from the potato cDNA. The product of this 
chimeric cDNA has e-cyclase activity and converts lycopene to the monocyclic 5-carotene. 

Figure 16 shows a comparison between the amino acid sequences of the Arabidopsis 
e-cyclase (SEQ ID NO:27) and the potato e-cyclase (SEQ ID NO:25). 

Figure 17A shows the nucleotide sequence of the Adonis palaestina Ipil (SEQ ID 
NO:28) and Figure 17B shows the nucleotide sequence of the Adonis palaestina Ipi2 (SEQ 
ID NO: 29). 
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Figure 18A shows the nucleotide sequence of the Haematoccus pluvialis Ipil (SEQ 
ID NO:l 1) and Figure 18B shows the nucleotide sequence of the Haematoccus pluvialis Ipi2 
(SEQ ID NO:30). 

Figure 19A shows the nucleotide sequence of the Lactuca sativa (romaine lettuce) 
5 Ipil (SEQ ID NO:3 1) and Figure 19B shows the nucleotide sequence of the Lactuca sativa 

Ipi2 (SEQ ID NO: 32). 

Figure 20 shows the nucleotide sequence of the Chlamydomonas reinhardtii Ipil 

(SEQ ID NO:33). 

Figure 21 A shows the nucleotide sequence of the Tagetes erecta (marigold) Ipil (SEQ 
10 ID NO:34) and Figure 2 IB shows the nucleotide sequence of the Oryza sativa (rice) Ipil 
(SEQIDNO:35). 

Figure 22 shows a amino acid sequence alignment of various plant and green algal 
isopentenyl isomerases (IPI) (SEQ ID NOS:16, 36-45). 

Figure 23 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
1 5 cDN A #5 nucleotide sequences. 

Figure 24 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
cDNA #5 predicted amino acid sequences. 

Figure 25 shows a sequence alignment of various plant p- and e-cyclases. Those 
sequences outlined in grey denote identical sequences among the e-cyclases. Those 
20 sequences outlined in black denote identical sequences among both the P- and e-cyclases. 

Figure 26 shows a sequence alignment of the plant e-cyclases from Figure 25. Those 
sequences outlined in black denote identical sequences among the e-cyclases. 

Figure 27 is a dendrogram or "tree" illustrating the degree of amino acid sequence 
similarity for various lycopene p- and e-cyclases. 
25 Figure 28 shows a comparison between Arabidopsis e-cyclase and lettuce e-cyclase 

predicted amino acid sequences. 

pFsrttTPTTON of THR PR FFEKKKD E MBODIMENTS 
The present invention includes an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p-carotene 
30 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21, 23 or 25-27. Nucleic acids encoding lycopene e-cyclase, p-carotene hydroxylase and IPP 
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isomerases have been isolated from several genetically distant sources. 

The present inventors have isolated nucleic acids encoding the enzyme IPP isomerase, 
which catalyzes the reversible conversion of isopentenyl pyrophosphate (IPP) to 
dimethylallyl pyrophosphate (DMAPP). IPP isomerase cDNAs were isolated from the plants 
5 A. thaliana, Tagetes erecta (marigold), Adonis palaestina (pheasant's eye), Lactuca sativa 
(romaine lettuce) and from the green algae K pluvialis and Chlamydomonas reinhardtii. 
Alignments of the amino acid sequences predicted by some of these cDNAs are shown in 
Figures 12 and 22. Plasmids containing some of these cDNAs were deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on March 4, 
10 1996 under ATCC accession numbers 98000 (pHP05 - K pluvialis); 98001 (pMDPl - 
marigold); 98002 (pATDP7 - A. thaliana) and 98004 (pHP04 - H. pluvialis). 

The present inventors have also isolated nucleic acids encoding the enzyme P- 
carotene hydroxylase, which is responsible for hydroxylating the P-endgroup in carotenoids. 
The nucleic acid of the present invention is shown in SEQ ID NO: 3 and Figure 5. The full 
15 length cDNA product hydroxylates both end groups of p -carotene as do products of cDNAs 
which encode proteins truncated by up to 50 amino acids from the N-terminus. Products of 
genes which encode proteins truncated between about 60-1 10 amino acids from the N- 
terminus preferentially hydroxylate only one ring. A plasmid containing this gene was 
deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 
20 20852 on March 4, 1996 under ATCC accession number 98003 (pATOHB - A. thaliana). 

The present inventors have also isolated nucleic acids encoding the enzyme lycopene 
e-cyclase, which is responsible for the formation of e-endgroups in carotenoids. The A. 
thaliane e-cyclase adds an e ring to only one end of the symmetrical lycopene while the 
related p -cyclase adds a ring at both ends. The A. thaliana cDNA of the present invention is 
25 shown in Figure 4 and SEQ ID NO: 1 . A plasmid containing this gene was deposited with 
the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on 
March 4, 1996 under ATCC accession number 98005 (pATeps - A. thaliana). 

In addition, lycopene e-cyclases have been identified in lettuce and in Adonis 
palaestina (cDNA #5) which encode enzymes that convert lycopene to the bicyclic e- 
30 carotene (e,£-carotene). An additional cDNA from Adonis palaestina (cDNA #3) encodes a 
lycopene e-cyclase which converts lycopene into 6-carotene (e,i|i-carotene) and differs from 
the lycopene e-cyclase which forms bicyclic e-carotene (e,e-carotene) by only 5 amino acids. 
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One or more of these amino acids may be modified by alteration of the nucleotide sequence 
in the #5 cDNA to obtain an enzyme which forms the bicyclic e,e-carotene. The sequences 
of ihe Adonis palaestina and Arabidopsis thaliana e -cyclases have about 70% nucleotide 
identity and about 72% amino acid identity. 

Initial experiments by the inventors with chimeric genes indicated that the part of the 
e-cyclase which is responsible for adding 2 e rings to form e,e-carotene is the carboxy 
terminal portion of the gene. The lettuce e-cyclase adds two e rings to form e,e-carotene. A 
DNA encoding a partial potato e-cyclase (missing its amino terminal portion), when 
combined with an amino terminal region from the lettuce e-cyclase gene, produces a 
monocyclic 6 -carotene (e,i|i-carotene). With the discovery of the differences between the 
Adonis palaestina clone #3 and clone #5, the specific amino acids responsible for the addition 
of an extra e ring have been identified (Figure 24). Specifically, amino acid 55 is Thr in 
clone #3 and Ser in clone #5, amino acid 210 is Asn in clone #3 and Asp in clone #5, amino 
acid 23 1 is Asp in clone #3 and Glu in clone #5, amino acid 352 is He in clone #3 and Val in 
clone #5, and amino acid 524 is Lys in clone #3 and Arg in clone #5. It can be appreciated 
that these changes are quite conservative, as only one change, at amino acid 210, changes the 
charge of the protein. 

Thus, it is clear that the nucleic acids of the invention encoding the enzymes as 
presently disclosed may be altered to increase a particularly desirable property of the enzyme, 
to change a property of the enzyme, or to diminish an undesirable property of the enzyme. 
Such modifications can be by deletion, substitution, or insertion of one or more amino acids, 
and can be performed by routine enzymatic manipulation of the nucleic acid encoding the 
enzyme (such as by restriction enzyme digestion, removal of nucleotides by mung bean 
nuclease or BaB\ 9 insertion of nucleotides by Klenow fragment, and by religation of the 
ends), by site-directed mutagenesis, or may be accidental, such as by low fidelity PCR or 
those obtained through mutations in hosts that are producers of the enzymes. These 
techniques as well as other suitable techniques are well known in the art. 

Mutations can be made in the nucleic acids of the invention such that a particular 
codon is changed to a codon which codes for a different amino acid. Such a mutation is 
generally made by making the fewest nucleotide changes possible. A substitution mutation 
of this sort can be made to change an amino acid in the resulting protein in a non- 
conservative manner (i.e., by changing the codon from an amino acid belonging to a grouping 
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of amino acids having a particular size or characteristic to an amino acid belonging to another 
grouping) or in a conservative manner (i.e., by changing the codon from an amino acid 
belonging to a grouping of amino acids having a particular size or characteristic to an amino 
acid belonging to the same grouping). Such a conservative change generally leads to less 
change in the structure and function of the resulting protein. A non-conservative change is 
more likely to alter the structure, activity or function of the resulting protein. The present 
invention should be considered to include sequences containing conservative changes which 
do not significantly alter the activity or binding characteristics of the resulting protein. 

The following is one example of various groupings of amino acids: 
Amino ar.ids yrith nrmpola rft groups: Alanine, Valine, Leucine, Isoleucine, Proline, 
Phenylalanine, Tryptophan and Methionine. 

Amino ar.ids with imr.Wped pr>1 a r K groups: Glycine, Serine, Threonine, Cysteine, Tyrosine, 
Asparagine and Glutamine. 

Amino ar.ids with r.harged polar R groups (negatively charged at Ph 6.0): Aspartic acid and 
Glutamic acid. 

Ttadr amino acids (positively charged at pH 6.0): Lysine, Arginine and Histidine. 

Another grouping may be those amino acids with phenyl groups: Phenylalanine, 

Tryptophan and Tyrosine. 

Another grouping may be according to molecular weight (i.e., size of R groups). 
Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an amino acid with a 
particularly preferable property. For example, a Cys may be introduced to provide a potential 
site for disulfide bridges with another Cys. A His may be introduced as a particularly 
"catalytic" site (i.e., His can act as an acid or base and is the most common amino acid in 
biochemical catalysis). Pro may be introduced because of its particularly planar structure, 
which induces P -turns in the protein's structure. 

It is clear that certain modifications of SEQ ID NOS: 2, 4, 14-21, 23 or 25-27 can take 
place without destroying the activity of the enzyme. It is noted especially that truncated 
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versions of the nucleic acids of the invention are functional. For example, several amino 
acids (from 1 to about 120) can be deleted from the N-terminus of the lycopene e-cyclases of 
the invention, and a functional protein can still be produced. This fact is made especially 
clear from Figure 25, which shows a sequence alignment of several plant €-cyclases. As can 
be seen from Figure 25, there is an enormous amount of sequence disparity between amino 
acid sequences 2 to about 50-70 (depending on the particular sequence, since gaps are 
present). There is less, but also a substantial amount of, sequence dissimilarity between about 
50-70 to about 90-120 (depending on the particular sequence). Thereafter, the sequences are 
fairly conserved, except for small pockets of dissimilarity between about 275-295 to about 
285-305 (depending on the particular sequence), and between about 395-415 to about 410- 
430 (depending on the particular sequence). 

The present inventors have found that the amount of the 5' region present in the 
nucleic acids of the invention can alter the activity of the enzyme. Instead of diminishing 
activity, truncating the 5 1 region of the nucleic acids of the invention may result in an enzyme 
with a different specificity. Thus, the present invention relates to nucleic acids and enzymes 
encoded thereby which are truncated to within 0-50, preferably 0-25, codons of the 5' 
initiation codon of their prokaryotic counterparts as determined by alignment maps as 
discussed below. 

For example, when the cDNA encoding A. thaliana p -carotene hydroxylase was 
truncated, the resulting enzyme catalyzed the formation of p-cryptoxanthin as the major 
product and zeaxanthin as minor product; in contrast to its normal production of zeaxanthin. 

The present invention is intended to include those nucleic acid and amino acid 
sequences in which substitutions, deletions, additions or other modifications have taken 
place, as compared to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27, without destroying the activity 
of the enzyme. Preferably, the substitutions, deletions, additions or other modifications take 
place at the 5' end, or any other of those positions which already show dissimilarity between 
any of the presently disclosed amino acid sequences (see also Figure 25) or other amino acid 
sequences which are known in the art and which encode the same enzyme (i.e., lycopene e- 
cyclase, IPP isomerase or p-carotene hydroxylase). 

In each case, nucleic acid and amino acid sequence similarity and identity is measured 
using sequence analysis software, for example, the Sequence Analysis, Gap, or BestFit 
software packages of the Genetics Computer Group (University of Wisconsin Biotechnology 
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Center, 1710 University Avenue, Madison, Wisconsin 53705), MEGAlign (DNAStar, Inc., 
1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 
S. Bascom Avenue, Suite 200, Campbell, California 95008). Such software uses algorithms 
to match similar sequences by assigning degrees of identity to various substitutions, 
5 deletions, and other modifications, and includes detailed instructions as to useful parameters, 
etc., such that those of routine skill in the art can easily compare sequence similarities and 
identities. An example of a useful algorithm in this regard is the algorithm of Needleman and 
Wunsch, which is used in the Gap program discussed above. This program finds the 
alignment of two complete sequences that maximizes the number of matches and minimizes 

1 0 the number of gaps. Another useful algorithm is the algorithm of Smith and Waterman, 
which is used in the BestFit program discussed above. This program creates an optimal 
alignment of the best segment of similarity between two sequences. Optimal alignments are 
found by inserting gaps to maximize the number of matches using the local homology 
algorithm of Smith and Waterman. 

15 Conservative (i.e. similar) substitutions typically include substitutions within the 

following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, glutamic 
acid, asparagine and glutamine; serine and threonine; lysine and arginine; and phenylalanine 
and tyrosine. Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle, J. Mol Biol 157: 105-132 (1982)), or on the basis of 

20 the ability to assume similar polypeptide secondary structure (see Chou and Fasman, Adv. 
Enzymol 47: 45-148 (1978)). 

If comparison is made between nucleotide sequences, preferably the length of 
comparison sequences is at least 50 nucleotides, more preferably at least 60 nucleotides, at 
least 75 nucleotides or at least 100 nucleotides. It is most preferred if comparison is made 

25 between the nucleic acid sequences encoding the enzyme coding regions necessary for 

enzyme activity. If comparison is made between amino acid sequences, preferably the length 
of comparison is at least 20 amino acids, more preferably at least 30 amino acids, at least 40 
amino acids or at least 50 amino acids. It is most preferred if comparison is made between 
the amino acid sequences in the enzyme coding regions necessary for enzyme activity. 

30 It should be appreciated that also within the scope of the present invention are nucleic 

acid sequences encoding lycopene e-cyclases, IPP isomerases and p -carotene hydroxylases 
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which code for enzymes having the same amino acid sequence as SEQ ID NOS: 2, 4, 14-21, 
23 or 25-27, but which are degenerate to the nucleic acids specifically disclosed herein. 

The amino acid residues described herein are preferred to be in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino acid residue, 
as long as the desired functional property of immunoglobulin-binding is retained by the 
polypeptide. 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the 
art. Such techniques are explained fully in the literature. See, e.g., Sambrook et al, 
"Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular 
Biology" Volumes Mil [Ausubel, R. M., ed. (1994)]; "Cell Biology: A Laboratory 
Handbook" Volumes I-III [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" 
Volumes I-III [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (MJ. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & SJ. Higgins eds. (1985)]; "Transcription And 
Translation" [B.D. Hames & SJ. Higgins, eds. (1984)]; "Animal Cell Culture" [R.I. 
Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [ERL Press, (1986)]; B. Perbal, "A 
Practical Guide To Molecular Cloning" (1984). 

The present invention also includes vectors. Suitable vectors according to the present 
invention comprise a nucleic acid of the invention encoding an enzyme involved in 
carotenoid biosynthesis or metabolism and a suitable promoter for the host, and can be 
constructed using techniques well known in the art (for example Sambrook et al., Molecular 
Cloning A Laboratory Manual . Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 
1989; Ausubel et al., Current Protocols in Molecular Biology . Greene Publishing and Wiley 
Interscience, New York, 1991). Suitable vectors for eukaryotic expression in plants are 
described in Frey et al, Plant J. (1995) 8(5):693 and Misawa et al, 1994a; incorporated herein 
by reference. Suitable vectors for prokaryotic expression include pACYC184, pUCl 19, and 
pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis (Invitrogen) and 
pET28 (Novagen) and derivatives thereof. The vectors of the present invention can 
additionally contain regulatory elements such as promoters, repressors, selectable markers 
such as antibiotic resistance genes, etc. 

The nucleic acids encoding the carotenoid enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a plant 
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expression system or to inhibit the expression of these enzymes. For example, a vector 
containing the gene encoding lycopene e-cyclase can be used to increase the amount of a- 
carotene and carotenoids derived from cc-carotene (such as lutein and ct-cryptoxanthin) in an 
organism and thereby alter the nutritional value, pharmacology and visual appearance value 
5 of the organism. 

Therefore, the present invention includes a method of producing or enhancing the 
production of a carotenoid in a host cell, relative to an untransformed host cell, the method 
comprising inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having lycopene e-cyclase, IPP isom erase or p- 

10 carotene hydroxylase enzyme activity, wherein the heterologous nucleic acid sequence is 
operably linked to a promoter; and expressing the heterologous nucleic acid sequence to 
produce the protein. 

The present invention also includes a method of modifying the production of 
carotenoids in a host cell, the method comprising inserting into the host cell a vector 

15 comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or p -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 
sequence in the host cell to modify the production of the carotenoids in the host cell, relative 

20 to the untransformed host cell. 

The term "modifying the production" means that the amount of carotenoids produced 
in the host cell can be enhanced, reduced, or left the same, as compared to the untransformed 
host cell. In accordance with one embodiment of the present invention, the make-up of the 
carotenoids (i.e., the specific carotenoids produced) is changed vis a vis each other, and this 

25 change in make-up may result in either a net gain, net loss, or no net change in the total 

amount of carotenoids produced in the cell. In accordance with another embodiment of the 
present invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) is enhanced by the insertion of an enzyme-encoding 
nucleic acid of the invention. In yet another embodiment of the invention, the production or 

30 the biochemical activity of the carotenoids (or the enzymes which catalyze their formation) 
may be reduced or inhibited by a number of different approaches available to those skilled in 
the art, including but not limited to such methodologies or approaches as anti-sense (e.g., 
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Gray et al (1992) Plant Mol. Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. 
Gen. Genet. 245:465-470), co-suppression (e.g., Fray and Grierson (1993) Plant Mol. Biol. 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. (1997) Plant J. 1 1:1 195- 
1206), intracellular antibodies (e.g., Rondon and Marasco (1997) Ann. Rev. Microbiol. 
5 51 :257-283) or whatever other approaches rely on the knowledge or availability of the nucleic 
acid or amino acid sequences of the invention and/or portions thereof, to thereby reduce 
accumulation of carotenoids with e rings and compounds derived from them (for e-cyclase 
inhibition), or carotenoids with hydroxylated P rings and compounds derived from them (for 
P -hydroxylase inhibition), or, in the case if IPP isomerase, accumulation of any isoprenoid 
10 compound. 

Preferably, at least a portion of the nucleic acid sequences used in the methods, 
vectors and host cells of the invention codes for an enzyme having an amino acid sequence 
which is at least 85% identical, preferably at least 90%, at least 95% or completely identical 
to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27. Sequence identity is determined as noted above. 

15 Preferably, sequence additions, deletions or other modifications are made as indicated above, 
so as to not affect the function of the particular enzyme. 

In a preferred embodiment, vectors are manufactured which contain a DNA encoding 
a eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic carotenoid 
enzyme. The inventors have discovered that inclusion of an IPP isomerase gene increases the 

20 supply of substrate for the carotenoid pathway; thereby enhancing the production of 

carotenoid endproducts, as compared to a host cell which is not transformed with such a 
vector. This is apparent from the much deeper pigmentation in carotenoid-accumulating 
colonies of E. coli which also contain one of the aforementioned IPP isomerase genes when 
compared to colonies that lack this additional IPP isomerase gene. Similarly, a vector 

25 comprising an IPP isomerase gene can be used to enhance production of any secondary 
metabolite of dimethylallyl pyrophosphate and/or isopentenyl pyrophosphate (such as 
isoprenoids, steroids, carotenoids, etc.). The term "isoprenoid" is intended to mean any 
member of the class of naturally occurring compounds whose carbon skeletons are composed, 
in part or entirely, of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 

30 a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A vector containing the cDNA encoding a lycopene e-cyclase of the invention, 
preferably the lettuce lycopene e-cyclase or Adonis e-cyclase #5, can be used to increase the 
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amount of bi cyclic e -carotene in an organism and thereby alter the nutritional value, 
pharmacology and visual appearance value of the organism. In addition, the transformed 
organism can be used in the formulation of therapeutic agents, for example in the treatment of 
cancer (see Mayne et al (1996) FASEB J. 10:690-701; Tsushima et al (1995) Biol. Pharm. 
5 Bull. 18:227-233). 

An antisense strand of a nucleic acid of the invention can be inserted into a vector. 
For example, the lycopene e-cyclase gene can be inserted into a vector and incorporated into 
the genomic DNA of a host, thereby inhibiting the synthesis of e,p -carotenoids (lutein and a- 
carotene) and enhancing the synthesis of p,P -carotenoids (zeaxanthin and P -carotene). 

10 The present invention also relates to novel enzymes which are encoded by the amino 

acid sequences of the invention, or portions thereof. 

The present invention also relates to novel enzymes which can transform known 
carotenoids into novel or uncommon products. Currently e-carotene (see Figure 2) and y- 
carotene are commonly produced only in minor amounts. As described below, an enzyme 

15 can be produced which transforms lycopene to y -carotene and lycopene to e-carotene. With 
these products in hand, bulk synthesis of other carotenoids derived from them are possible. 
For example, e-carotene can be hydroxylated to form lactucaxanthin, an isomer of lutein (one 
e and one P ring) and zeaxanthin (two p rings) where both endgroups are, instead, e rings. 

In addition to novel enzymes produced by truncating the 5' region of known enzymes, 

20 as discussed above, novel enzymes which can participate in the formation of unusual 

carotenoids can be formed by replacing portions of one gene with an analogous sequence 
from a structurally related gene. For example, p -cyclase and e-cyclase are structurally 
related (see Figure 13). By replacing a portion of p -lycopene cyclase with the analogous 
portion of e-cyclase, an enzyme which produces y -carotene will be produced (one p 

25 endgroup). Further, by replacing a portion of the lycopene e-cyclase with the analogous 
portion of P -cyclase, an enzyme which produces e-carotene will be produced (with some 
exceptions, such as the lettuce e-cyclase, plant e-cyclases normally produce a compound with 
one e-endgroup, 8 -carotene). Similarly, p -hydroxylase could be modified to produce 
enzymes of novel function by creation of hybrids with e-hydroxylase. 

30 Host systems according to the present invention can comprise any organism that 

already produces carotenoids or which has been genetically modified to produce carotenoids. 
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The IPP isomerase genes are more broadly applicable for enhancing production of any 
product dependent on DMAPP and/or IPP as a precursor. 

Organisms which already produce carotenoids include plants, algae, some yeasts, 
fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
5 with vectors according to the present invention can be done using standard techniques such as 
those described in Misawa et ah, (1990) supra; Hundle et ah, (1993) supra; Hundle et al., 
(1991) supra; Misawa et al., (1991) supra; Sandmann et al., supra; and Schnurr et al., supra. 

Transgenic organisms can be constructed which include the nucleic acid sequences of 
the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 1994a; Misawa et 
10 al, 1994b; Cunningham et al, 1993). The incorporation of these sequences can allow the 
controlling of carotenoid biosynthesis, content, or composition in the host cell. These 
transgenic systems can be constructed to incorporate sequences which allow for the 
overexpression of the nucleic acids of the present invention. Transgenic systems can also be 
constructed containing antisense expression of the nucleic acid sequences of the present 
1 5 invention. Such antisense expression would result in the accumulation of the substrates of the 
substrates of the enzyme encoded by the sense strand. 

A method for screening for eukaryotic genes which encode enzymes involved in 
carotenoid biosynthesis comprises transforming a prokaryotic host with a nucleic acid which 
may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; culturing said 
20 transformed host to obtain colonies; and screening for colonies exhibiting a different color 
than colonies of the untransformed host. 

Suitable hosts include E. coli, cyanobacteria such as Synechococcus and 
Synechocystis, alga and plant cells. E. coli are preferred. 

In a preferred embodiment, the above "color complementation" screening protocol can 
25 be enhanced by using mutants which are either (1) deficient in at least one carotenoid 

biosynthetic gene or (2) overexpress at least one carotenoid biosynthetic gene. In either case, 
such mutants will accumulate carotenoid precursors. 

Prokaryotic and eukaryotic DNA or cDNA libraries can be screened in total for the 
presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred 
30 organisms to be screened include photosynthetic organisms. 
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E. coli can be transformed with these eukaryotic cDNA libraries using conventional 
methods such as those described in Sambrook et al, 1989 and according to protocols 
described by the vendors of the cloning vectors. 

For example, the cDNA libraries in bacteriophage vectors such as lambdaZAP 
5 (Stratagene) or lambda ZIPLOX (Gibco BRL) can be excised en masse and used to transform 
E.colL 

Transformed E. coli can be cultured using conventional techniques. The culture broth 
preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics include 
penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 15-40°C, 

10 preferably at room temperature or slightly above (18-28°C), for 12 hours to 7 days. 

Cultures are plated and the plates are screened visually for colonies with a different 
color than the colonies of the host E. coli transformed with the empty plasmid cloning vector. 
For example, E. coli transformed with the plasmid, pAC-BETA (described below), produce 
yellow colonies that accumulate p -carotene. After transformation with a cDNA library, 

15 colonies which contain a different hue than those formed by E. co/z/p AC-BETA would be 
expected to contain enzymes which modify the structure or accumulation of p -carotene. 
Similar E. coli strains can be engineered which accumulate earlier products in carotenoid 
biosynthesis, such as lycopene, y -carotene, etc. 

Having generally described this invention, a further understanding can be obtained by 

20 reference to certain specific examples which are provided herein for purposes of illustration 
only and are not intended to be limiting unless otherwise specified. 

BXAMPT.R 

L Isolation of p -carotene hydroxylase 
Plasmid Construction 

25 An 8.6kb BgUI fragment containing the carotenoid biosynthetic genes of Erwinia 

herbicola was first cloned in the BamHI site of plasmid vector pACYC184 (chloramphenicol 
resistant), and then a 1.1 kb BamHI fragment containing the E. herbicola p -carotene 
hydroxylase (CrtZ) was deleted. E.coli strains containing the resulting plasmid, p AC-BETA, 
accumulate B-carotene and form yellow colonies (Cunningham et al., 1994). 

30 A full length cDNA encoding IPP isomerase of Haematococcus pluvialis (HP04) was 

first excised with BamHI and Kpnl from pBluescript SK-, and then ligated into the 
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corresponding sites of the pTrcHisA vector with high-level expression from the trc promoter 
(Invitrogen, Inc.). A fragment containing the IPP isomerase and trc promoter was 
subsequently excised with EcoRV and Kpnl, treated with the Klenow fragment of DNA 
polymerase to produce blunt ends, and ligated in the Klenow-treated HindRl site of pAC- 
BETA. E.coli cells transformed with this new plasmid pAC-BETA-04 form orange colonies 
on LB plates (vs. yellow for those containing pAC-BETA) and cultures accumulate 
substantially more P -carotene (ca. two fold) than those that contain pAC-BETA. 



Screening of an Arahidopsis cDNA Library 

Several X cDNA expression libraries of Arahidopsis were obtained from the 
Arahidopsis Biological Resource Center (Ohio State University, Columbus, OH) (Kieber et 
al., 1993). The X cDNA libraries were excised in vivo using Stratagene's ExAssist SOLR 
system to produce a phagemid cDNA library wherein each phagemid contained also a gene 
conferring resistance to the antibiotic ampicillin. 

E.coli strain DH10BZIP was chosen as the host cell for the screening and pigment 
production, although we have also used TOP10F' and XLl-Blue for this purpose. DH10B 
cells were transformed with plasmid pAC-BETA-04 and were plated on LB agar plates 
containing chloramphenicol at 50 ^ig/ml (from United States Biochemical Corporation). The 
phagemid Arahidopsis cDNA library was then introduced into DH10B cells already 
containing pAC-BETA-04. Transformed cells containing both pAC-BETA-04 and 
Arahidopsis cDNA library phagemids were selected on chloramphenicol plus ampicillin (150 
jag/ml) agar plates. Maximum color development occurred after 3 to 7 days incubation at 
room temperature, and the rare bright yellow colonies were selected from a background of 
many thousands of orange colonies on each agar plate. Selected colonies were inoculated 
into 3 ml liquid LB medium containing ampicillin and chloramphenicol, and cultures were 
incubated at room temperature for 1-2 days, with shaking. Cells were then harvested by 
centrifugation and extracted with acetone in microfuge tubes. After centrifugation, the 
pigmented extract was spotted onto silica gel thin-layer chromatography (TLC) plates, and 
developed with a hexane:ether (1 : 1, by volume) mobile phases. 6-carotene hydroxylase- 
encoding cDNAs were identified based on the appearance of a yellow pigment that co- 
migrated with zeaxanthin on the TLC plates. 
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Subcloning and Sequencing 

The plasmid containing the B-carotene hydroxylase cDNA was recovered and 
analyzed by standard procedures (Sambrook et al., 1989). The Arabidopsis B-carotene 
hydroxylase was sequenced completely on both strands on an automatic sequencer (Applied 
5 Biosystems, Model 373 A, Version 2.0. 1 S). The cDNA insert of 0.95kb also was excised 
and ligated into the a pTrcHis vector. A Bglll restriction site within the cDNA was used to 
remove that portion of the cDNA that encodes the predicted polypeptide N terminal sequence 
region that is not also found in bacterial p-carotene hydroxylases (Figure 6). A Bglll-Xhol 
fragment was directionally cloned in BamHI-XhoI digested TrcHis vectors. 

1 0 Pigment Analysis 

A single colony was used to inoculate 50 ml of LB containing ampicillin and 
chloramphenicol in a 250-ml flask. Cultures were incubated at 28°C for 36 hours with gentle 
shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed once with 
distilled H 2 0 and resuspended with 0.5 ml of water. The extraction procedures and HPLC 
15 were essentially as described previously (Cunningham et al, 1994). 

II. Isolation and biochemical analysis of an Arabidopsis lycopene €-cyclase 
Plasmid Construction 

Construction of plasmids pAC-LYC, pAC-NEUR, and pAC-ZETA is described in 
Cunningham et al., (1994). In brief, the appropriate carotenoid biosynthetic genes from 

20 Erwinia herbicola, Rhodobacter capsulatus, and Synechococcus sp. strain PCC7942 were 

cloned in the plasmid vector pACYC184 (New England BioLabs, Beverly, MA). Cultures of 
E. coli containing the plasmids pAC-ZETA, pAC-NEUR, and pAC-LYC, accumulate C- 
carotene, neurosporene, and lycopene, respectively. The plasmid pAC-ZETA was 
constructed as follows: an 8.6-kb Bglll fragment containing the carotenoid biosynthetic genes 

25 of E. herbicola (GenBank M87280; Hundle et al., 1991) was obtained after partial digestion 
of plasmid pPL376 (Perry et al., 1986; Tuveson et al., 1986) and cloned in the BamHI site of 
pACYC184 to give the plasmid pAC-EHER. Deletion of adjacent 0.8- and 1.1-kb BamHI- 
BamPfl fragments (deletion Z in Cunningham et al., 1994), and of a 1.1 kB Sall-Sall fragment 
(deletion X) served to remove most of the coding regions for the E. herbicola p -carotene 

30 hydroxylase (crtZ gene) and zeaxanthin glucosyltransferase (crtX gene), respectively. The 
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resulting plasmid, pAC-BETA, retains functional genes for geranylgeranyl pyrophosphate 
synthase (crtE), phytoene synthase (crtB), phytoene desaturase (crtl), and lycopene cyclase 
(crtY). Cells of E. coli containing this plasmid form yellow colonies and accumulate p- 
carotene. A plasmid containing both the lycopene e- and P -cyclase cDNAs of A. thaliana 
5 was constructed by excising the e-cyclase in clone y2 as a PvuI-PvuII fragment and ligating 
this piece in the SnaBI site of a plasmid (pSPORT 1 from GIBCO-BRL) that already 
contained the P -cyclase (Cunningham et al., 1996). 



Organisms and Growth Conditions 

E. coli strains TOP 10 and TOP 10 F 5 (obtained from Invitrogen Corporation, San 
10 Diego, CA) and XLl-Blue (Stratagene) were grown in Luria-Bertani (LB) medium 

(Sambrook et al., 1989) at 37°C in darkness on a platform shaker at 225 cycles per min. 

Media components were from Difco (yeast extract and tryptone) or Sigma (NaCl). 

Ampicillin at 150 ng/mL and/or chloramphenicol at 50 ng/mL (both from United States 

Biochemical Corporation) were used, as appropriate, for selection and maintenance of 
15 plasmids. 

Mass Excision and Color Complementation Screening of an A. thaliana cDNA Library 

A size-fractionated 1-2 kB cDNA library of A. thaliana in lambda ZAPII (Kieber et 
al., 1993) was obtained from the Arabidopsis Biological Resource Center at The Ohio State 
University (stock number CD4-14). Other size fractionated libraries were also obtained 

20 (stock numbers CD4-13, CD4-15, and CD4-16). An aliquot of each library was treated to 

cause a mass excision of the cDNAs and thereby produce a phagemid library according to the 
instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain XLl- 
Blue and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of E. coli 

25 TOP 10 F' (this strain contained the plasmid pAC-LYC) by incubation of the phagemid with 
the E. coli cells for 15 min at 37 °C. Cells had been grown overnight at 30° C in LB medium 
supplemented with 2% (w/v) maltose and 10 mM MgS0 4 (final concentration), and harvested 
in 1.5 ml microfuge tubes at a setting of 3 on an Eppendorf microfuge (541 5C) for 10 min. 
The pellets were resuspended in 10 mM MgS0 4 to a volume equal to one-half that of the 
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initial culture volume. Transformants were spread on large (150 mm diameter) LB agar petri 
plates containing antibiotics to provide for selection of cDNA clones (ampicillin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming units 
were spread on each plate. Petri plates were incubated at 37-C for 16 hr and then at room 
5 temperature for 2 to 7 days to allow maximum color development. Plates were screened 
visually with the aid of an illuminated 3x magnifier and a low power stage-dissecting 
microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be observed 
in the background of pink colonies. A colony color of yellow or pinkish-yellow was taken as 
presumptive evidence of a cyclization activity. These yellow colonies were collected with 

10 sterile toothpicks and used to inoculate 3ml of LB medium in culture tubes with overnight 
growth at 37° C and shaking at 225 cycles/min. Cultures were split into two aliquots in 
microfuge tubes and harvested by centrifugation at a setting of 5 in an Eppendorf 541 5C 
microfiige. After discarding the liquid, one pellet was frozen for later purification of plasmid 
DNA. To the second pellet was added 1.5 ml EtOH, and the pellet was resuspended by 

15 vortex mixing, and extraction was allowed to proceed in the dark for 15-30 min with 

occasional remixing. Insoluble materials were pelleted by centrifugation at maximum speed 
for 10 min in a microfuge. Absorption spectra of the supernatant fluids were recorded from 
350-550 nm with a Perkin Elmer lambda six spectrophotometer. 

Analysis of isolated clones 

20 Eight of the yellow colonies contained P -carotene indicating that a single gene 

product catalyzes both cyclizations required to form the two p endgroups of the symmetrical 
P -carotene from the symmetrical precursor lycopene. One of the yellow colonies contained a 
pigment with the spectrum characteristic of 6 -carotene, a monocyclic carotenoid with a single 
e endgroup. Unlike the p cyclase, this €-cyclase appears unable to carry out a second 

25 cyclization at the other end of the molecule. 

The observation that €-cyclase is unable to form two cyclic e-endgroups (e.g. the 
bicyclic e-carotene) illuminates the mechanism by which plants can coordinate and control 
the flow of substrate into carotenoids derived from p -carotene versus those derived from a- 
carotene and also can prevent the formation of carotenoids with two e endgroups. 

30 The availability of the A. thaliana gene encoding the e-cyclase enables the directed 

manipulation of plant and algal species for modification of carotenoid content and 
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composition. Through inactivation of the e-cyclase, whether at the gene level by deletion of 
the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by 
such as antisense technology), one may increase the formation of p -carotene and other 
pigments derived from it. Since vitamin A is derived only from carotenoids with p 
5 endgroups, an enhancement of the production of p -carotene versus a -carotene may enhance 
nutritional value of crop plants. Reduction of carotenoids with e-endgroups may also be of 
value in modifying the color properties of crop plants and specific tissues of these plants. 
Alternatively, where production of a -carotene, or pigments such as lutein that are derived 
from a -carotene, is desirable, whether for the color properties, nutritional value or other 

10 reason, one may overexpress the e-cyclase or express it in specific tissues. Wherever 

agronomic value of a crop is related to pigmentation provided by carotenoid pigments the 
directed manipulation of expression of the e-cyclase gene and/or production of the enzyme 
may be of commercial value. 

The predicted amino acid sequence of the A. ihaliana e-cyclase enzyme was 

15 determined. A comparison of the amino acid sequences of the P- and e-cyclase enzymes of 
Arabidopsis thaliana (Fig. 13) as predicted by the DNA sequence of the respective cDNAs 
(Fig. 4 for the e-cyclase cDNA sequence), indicates that these two enzymes have many 
regions of sequence similarity, but they are only about 37% identical overall at the amino acid 
level. The degree of sequence identity at the DNA base level, only about 50%, is sufficiently 

20 low such that we and others have been unable to detect this gene by hybridization using the p 
cyclase as a probe in DNA gel blot experiments. 
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We claim: 

1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23 or 25-27. 

2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23 or 25-27. 

3. A vector comprising the nucleic acid sequence of claim 1, wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene e-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23 or 
25-27. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ED NOS: 23 or 25-27. 
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AMENDED CLAIMS 

[received by the International Bureau on 15 November 1999 (15.11 .99); 
original claims 1,2,7 and 8 amended; remaining claims unchanged (1 page)] 

1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 

lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23, 25 or 26. 

2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23, 25 or 26. 

3. A vector comprising the nucleic acid sequence of claim 1, wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene e-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23, 25 
or 26. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23, 25 or 26. 
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FIG. 4 A 

Arabidopsis thaliana epsilon cyclase: 

acaaaaggaaataattag attcctctttctgcttgctataccttgaca 48 

gaacaacataacaatggtgtaagtcttctc gctgtattcgaaattatttggaggaggaac 108 

atggagtgtgttggggctaggaatttcgca gcaatggcggtttcaacatttccgtcatgg 168 
1 M E C VTA KN F A A M A V S T F P S W 

agttgtcgaaggaaatttccagtggctaag agatacagctataggaatattcgcttcqgt 228 
21 SCR^KFPVVK RYSYRNlKFb 

^ ttgtgta^tgtcagagcta^cggcggcgga agttccggta^tgaga^tt^tgtagcggtg 288 

agagaagatttcgctgacgaagaagatttt gcgaaagctggcggttctgagattctattt 348 
61 RSDFADEEDF V E S R I L F 

gttcaaatgcagcagaacaaagatatggat gaacagtctaagcttgttgataagttgcct 408 
81 VQMQQMKDMD SQSKLVDKLP 

cctatatcaactggtgatggtgctttggat catgtggttactggctgtggtcctgctggt 468 
101 P I S ila Dla A L D K TV 1^ P A^ 

ttagccttggctgcagaatcagctaagctt ggattaaaagttggactcattggtccagat 528 
121 LALAAKSAKL GLKVGLIGPD 

cttccttttactaacaattacggtgtttgg gaagatgaattcaatgatcttgggctgcaa 588 
141 LPFTMMYGVM KDKFNDLGLG 

aaatgtattgagcatgtttggagagagact attgcgcacctggatgatgacaagcctatt 648 
161 KCIKKVWRST IVYLDDDKPI 

accattggccgtgcttatggaagagttagt cgacgtttgctccatgaggagcttttgagg 708 
181 T lb R A Y G R V S RRLLXEELLR^ 

aggtgtgtcgagtcaggtgtctcgtacctt agctcgaaagttgacagcataacagaagct 768 
201 fCVKS^VSYL SSKVDSITEA 



tgtgatggccttagacttgttgcttgtgac gacaataacgtcattccctgcaggcttgcc 828 
SDTaLXLVACD DMMVIPCXLA 

actgttgcttctggagcagcttcgggaaag ctcttgcaatacgaagttggtggacctaga 888 
241 TVASbAASGK LLQYXVGbPR 

gtctgtgcgcaaactgcatacggcgtggag gttgaggcggaaaatagtccatatgatcca 948 
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FIG. 4B 

261 VCVQTAYGVX VXVXNSPYDP 

qatcaaatggttttcatggattacagagat tatactaacgagaaagttcqgagcttagaa 1006 
281 D Q M V P Ml Y R D YTMXXVRSLX 

qctqaqtatccaacgtttctgtacgccatg cctatgacaaagtcaagactcttcttcgag 1068 
301 ^TY P T F L Y A M PMTKSRLFFK 

qaqacatgtttggcctcaaaagatgtcatg ccctttgatttgctaaaaacgaagctcatg 1128 
321 XT C lA SKDVM P FDLLKTKLM 

ttaapattagacacartcgpiattc^aatt cteaagarttacgaagaggagtggtcctat 1188 

atcccagttggtggttccttgccaaacacc ^acaaaagaatctcgcctttggtgctgcc 1248 

qctaqcatggtacatcccgcaacaggctat tcagttgtgagatctttgtctgaagctcca 1308 
381 ^ SHI M PTT G Y SVVRSLSXAP 

aaacatgcatcagtcatcg^gagatacta agagaag|gactaccaaacagattaaca^t 1368 

aatatttcaagacaagcttaggatacttta tggccaccagaaaqgaaaagacagaqagca 1428 
421 M I S R Q A WD T L WPPERXRQRA 

ttctttctctttgqtcttgcactca^agtt caattcgataccga^gg^ttagaa^cttc 1488 

ttccgtactttcttccgccttccaaaatgg atgtggcaagqgtttctaggatcaacatta 1548 
461 FRTPFRLPKW MWQGFLGSTL 

acatcaggagatctcgttctctttgcttta tacatgttcgtcatttcaccaaacaatttg 1608 
481 T sXd LVLFAL YMPVISPMML 

agaaaaggtctcattaatcatctcatctct gatccaaccgqagcaaccatgataaaaacc 1668 
501 Hi LINWLIS DPtThMIKT 

tatctcaaagtatgatttacttaccaactc ttaggtttgtgtatatatatgccgatttat 1728 
521 Y L K V 

ctgaaiaatcgatcaaagaatggtatgtgg gttactaggaagttggaaacaaacacgtat 1788 

agaatctaaggagtgatcgaaatggagacg gaaacgaaaagaaaaaaatcagtctttgtt 1848 

ccgtggctagtg 1868 
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G. 5 

accgatttcc gactccgcct cccgaaatcc 
tcttcgattt aaacgctttt ctgtctgtta 
agaattctcc gattgagaac gatgagagac 
aacgctatag acgctgagta tctggcgttg 
gaggaagaaa tcggagaggt ccacttatct 
gctttggtat cacttctatg gctgttatgg 
tggcaaatgg agggaggtga gatctcaatg 
tgctctctct gttggtgctg ctgttggtat 
ctcatagagc tctgtggcac gcttctctat 
cacaaaccaa gagaaggacc gtttgagcta 
gaacgctggt ccagcgattg gtctcctctc 
gactcgttcc tggtctctgc tttggcgccg 
ggaatcgcct acatgtttgt ccacgatggt 
tgtaggtccc atcgccgacg tcccttacct 
accagctaca tcacacagac aagttcaatg 
cttggaccca aggaattgga agaagttgga 
ggagattagt cggagaatca aatcatacaa 
cgagttcgag ttcttgactt taaacaagtt 
ttgtcttctg tcattatgat catcttaaga 
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FIG. 7 
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FIG. 8 

1 ccctgtc tgtttcttet tt.ttta.tc teccttg.t tcgcctc.g. 
51 tetctcgctc tttegtcttc tttttcttct ttccGATTTC CCCATCCTCC 
101 TCTGTCATCC ATTTCACCGA GAAAGTTACC GAATTTTCGT GCTTTCTCTG 
151 GTACCGCTAT GACAGATACT AAAGATGCTG GTATGOATGC TGTTCAGAGA 
201 CGTCTCATGT TTGAGGATGA ATGCATTCTT GTTGATGAAA CTGATCGTGT 
2 51 TGTGGGGCAT GTCAGCAAGT ATAAriGTCA TCTGATGGAA AATATTGAAG 
301 CCAAGAATTT GCTGCACAGG G CTTTTAGTG TATTTTTATT CAACTCGAAG 
351 TATGAGTTGC TTCTCCAGCA AAGGTCAAAC ACAAAGGTTA CGTTCCCTCT 
401 AGTGTGGACT AACACTTGTT GCAGCCATCC TC7TTACCGT GAATCAGAGC 
451 TTATCCAGGA CAATGCACTA GGTGTGAGGA ATGCTGCACA AAGAAAGCTT 
501 CTCGATGAGC TTGGTATTGT AGCTGAAGAT GTACCAGTCG ATGAGT7CAC 
551 TCCCTTGGGA CCTATGCTGT ACAAGGCTCC TTCTGATGGC AAATGGGGAG 
601 AGCATGAACT TGATTACTTG CTCTTCATCG TGCGAGACGT GAAGGTTCAA 
651 CCAAACCCAG ATGAAGTAGC TGAGATCAAG TATGTGAGCC GGG AAG AG CT 
701 GAAGGAGCTG GTGAAGAAAG CAGATGCAGG TGAGGAAGGT TTGAAACTGT 
7S1 CACCATGGTT CAGATTGGTG GTGGACAATT TCTTGATGAA GTGGTGGGAT 
801 CATGTTGAGA AAGGAACTTT GGTTGAAGCT ATAGACATGA AAACCATCCA 
851 CAAACTCTGA ACATCTTTTT TTAAAGTTTT TAAATCAATC AACTTTCTCT 
901 TCATCATTTT TATCTTTTCG ATGATAATAA TTTGGGATAT GTGAGACACT 
951 TACAAAACTT CCAAGCACCT GAGG CAATAA TAAAGTTTGC GGCCGC 
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FIG. 9 

1 CTCGGTACCT CGCCACAATC GCTATTTGGA ACCTGGCCCG GCGG.CAGTCC 

51 GATGCCGCGA TGCTTCGTTC GTTGCTCAGA' GGCCTCACGC ATATCCCCCG 

101 CGTGAACTCC GCCCAGCAGC CCAGCTGTGC ACACGCGCGA CTCCAGTTTA 

151 AGCTCAGGAG CATGCAGATG ACGCTCATGC AGCCCAGCAT CTCAGCCAAT 

201 CTGTCGCGCG CCCAGGACCG CACAGACCAC ATGAGGGGTG CAAGCACCTG 

251 GGCXGCCGCC CAGTCCCAGG ATGAGCTGAT GCTGAAGGAC GAGTGCATCT 

301 TGGTGGATGT TGAGGACAAC ATCACAGGCC ATGCCAGCAA GCTCGAGTGT 

351 CACAAGTTCC TACCACATCA GCCTGCAGGC CTGCTGCACC GGGCCTTCTC 

401 TGTGTTCCTG TTTGACGATC AGGGGCGACT GCTGCTGCAA CAGCGTGCAC 

451 GCTCAAAAAT CACCTTCCCA AGTGTGTGGA CGAACACCTG CTGCAGCCAC 

501 CCTTTACATG GGCAGACCCC AGATGAGGTG GACCAACTAA GCCAGGTGGC 

551 CGACGGAACA GTACCTGCCG CAAAGGCTGC TGCCATCCGC AAGTTGGAGC 

601 ACGAGCTGGG GATACCAGCG CACCAGCTGC CGGCA-AGCGC GTTTCGCTTC 

651 CTGACGCGTT TGCACTACTG TGCCGCGGAC GTGCAGCCAG CTGCGACACA 

701 ATCAGCGCTC TGGGGCGAGC ACGAAATGGA CTACATCTTG TTCATCCGGG 

751 CCAACGTCAC CTTGGCGCCC AACCCTGACG AGGTGGACGA AGTCAGGTAC 

801 GTGACGCAAG AGGAGCTGCG GCAGATGATG CAGCCGGACA ACGGGCTGCA 

851 ATGGTCGCCG TGGTTTCGCA TCATCGCCGC GCG CTTCCTT GAGCGTTGGT 

901 GGGCTGACCT GGACGCGGCC CTAAACACTG ACAAACACGA GGATTGGGGA 

951 ACGGTGCAtd ACATCAACGA AGCGTGAAAG CAGAAGCTGC AGGATGTGAA 

1001 GACACGTCAT GGGGTGGAAT TGCGTACTTG GCAGCTTCGT ATCTCCTTTT 

1051 TCTGAGACTG AACCTGCAGT CAGGTCCCAC AAGGTCAGGT AAAATGGCTC 

1101 G AT AAAATGT ACCGTCACTT TTTGTCGCGT ATACTGAACT CCAAGAGGTC 

1151 AAAAAAAAAA AAAAA 
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CACAATCAGC 
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751- 
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GTACGTGACG 


301 


GACAACGGGC 


TTCAATGGTC 
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CCTTGAGCGT 
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901 
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GGGAACGGTG 
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1051 


CATCATATTC 


ATCGTCTCTC 
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IG. 10 

GCTATTTGGA ACCTGGCCCG CCGGCAG7CC 
GTTGCTCAGA GGCCTCACGC ATATCCCGCG 
CCAGCTGTGC ACACGCGCGA CTCCAGTTTA 
CTTTCCGAGG ACCGCACAGA CCACATGAGG 
CGGGCAGTCG CAGGATGAGC TGATGCTGAA 
ATGTTGAGGA CAACATCACA GGCCA7CCCA 
TTCCTACCAC ATCAGCCTGC AGGCCTGCTG 
CCTGTTTGAC GATCAGGGGC GACTGCTGCT 
AAATCACCTT CCCAAGTGTG TGGACGAACA 
CATGGGCAGA CCCCAGATGA GGTGGACCAA 
AACAGTACCT GGCGCAAAGG CTGCTGCCAT 
TGGGGATACC AGCGCACCAG CTGCCGGCAA 
CGTTTGCACT ACTGTGCCGC GGACGTGCAG 
GCTCTGGGGC GAG CACG AAA TGGACTACAT 
TCACCTTGGC GCCCAACCCT GACGAGGTGG 
CAAGAGGAGC TGCGGCAGAT GATGCAGCCG 
GCCGTGGTTT CGCATCATCG CCGGGCGCTT 
ACCTGGACGC GGCCCTAAAC ACTGACAAAC 
CATCACATCA ACGAAGCGTG AAGGCAGAAG 
TCATGGGGTG GAATTGCGTA CTTGGCAGCT 
ACTGAACCTG CAGAGCTAGA GTCAATGGTG 
TTTTGTTTTA GACTAATCTG TAGCTAGAGT 
CTTTCAAAAA AAAAA 
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FIG. I I A 



1 



HP04 MLRSLLRGLT HIPRVNSAQQ 
HP05 MLRSLLRGLT HIPRVNSAQQ 
ATDP7 MSVSSLFNLP .LIRLRSLA. 
C.brew. MS.SSMLNFT .ASRIVSLPL 

AT0P5 TGPPPRFFP 

S.cerev. . .MTADNNSM PHGAVSSYAK 



51 

AEDRTDHMRG 
SEDRTDHMRG 
S..GTA.MTD 
SSQATT.MGE 

T.MTD 

SSETSNDESG 

101 

LPHQPAGLLH 
LPHQPAGLLH 
ENIEAKNLLH 
ENIESENLLH 
EKIEAENLLH 
ENIE.KGLLH 



ASTWAGGQSQ 
ASTWAGGQSQ 
TKDAGMDAVQ 
VVDAGMDAVQ 
SNDAGMDAVQ 
ETCFSGHDEE 



RAFSVFLFDD 
RAFSVFLFDD 
RAFSVFLFNS 
RAFSVFLFNS 
RAFSVFLFNS 
RAFSVFIFNE 



PSCAHARLQF 
PSCAHARLQF 
LSSSFSSFRF 
LSSPPSRVHL 
IRSPVPRTQL 
LVQNQTPEDI 



DELMLKDECI 
DELMLKDECI 
RRLMFEDECI 
RRLMFEDECI 
RRLMFEDECI 
QIKLMNENCI 



50 

KLRSMQMTLM QPSISANLSR 

KLRSMQLL. . 

AHRPLSSIS. PRKLPNFRAF 
PLCFFSPISL TQRFSAKLTF 

FVRAFSAV 

LEEFPEIIPL QQRPN...TR 



LVDVEDNITG 
LVDVEDNITG 
LVDETDRVVG 
LVDENDKVVG 
LVDENNRVVG 
VLDWDDNAIG 



QGRLLLQQRA 
QGRLLLQQRA 
KYELLLQQRS 
KYELLLQQRS 
KYELLLQQRS 
QGELLLQQRA 



151 

GQTPDEVDQL SQVADGTVPG 
GQTPDEVDQL SQVADGTVPG 

RE SELIQDNALG 

RE SELIDENCLG 

RE SELIEENVLG 

ID...DELGL KGKLDDKIKG 

201 

RLHYCAADVQ PAATQSALWG 
RLHYCAADVQ PAATQSALWG 

RMLY KAPSDGKWG 

RILY KAPSDGKWG 

RMLY KAPSDGKWG 

RIHY MAPSNEPWG 



AKAAAIRKLE 
AKAAAIRKLE 
VRNAAQRKLL 
VRNAAQRKLL 
VRNAAQRKLF 
AITAAVRKLD 



RSKITFPSVW 
RSKITFPSVW 
NTKVTFPLVW 
ATKVTFPLVW 
KTKVTFPLVW 
TEKITFPDLW 



HELGIPAHQL 
HELGIPAHQL 
DELGIVAEDV 
DELGIPAEDL 
DELGIVAEDV 
HELGIPEDET 



100 

HASKLECHKF 
HASKLECHKF 
HVSKYNCHLM 
HESKYNCHLM 
HDTKYNCHLM 
AGTKKVCHLM 

150 

TNTCCSHPLH 
TNTCCSHPLH 
TNTCCSHPLY 
TNTCCSHPLY 
TNTCCSHPLY 
TNTCCSHPLC 

200 

PA.SAFRFLT 
PA.SAFRFLT 
PV.DEFTPLG 
PV.DQFIPLS 
PV.DEFTPLG 
KTRGKFHFLN 



EHEMDYILFI 
EHEMDYILFI 
EHELDYLLFI 
EHELDYLLFI 
EHEVDYLLFI 
EHEIDYILFY 



250 

.... RANVTL APNPDEVDEV 
. . . . RANVTL APNPDEVDEV 
....VRDVKV QPNPDEVAEI 

IRDVNL DPNPDEVAEV 

VRDVKL QPNPDEVAEI 

KINAKENLTV NPNVNEVRDF 
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FIG. I I B 

251 300 
RYVTQEELRQ WQ....PDN 6LQWSPWFRI IAARFLERWW ADLDAALNTD 

RYVTQEELRQ NMQ PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 

KYVSREELKE LVKKADAGEE GLKLSPWFRL VVDNFLMKWW DHVEKGTLVE 
KYMNRDDLKE LLRKADAEEE GVKLSPWFRL VVDNFLFKWW DHVEKGSLKD 
KYVSREELKE LVKKADAGDE AVKLSPWFRL VVDNFLMKWW DHVEKGTITE 
KWVSPNDLKT MF ADP SYKFTPWFKI ICENYLFNWW EQLDDLSEVE 

301 

KHEDWGTVHH INEA* 
KHEDWGTVHH INEA* 
A.IDMKTIHK L* 
A.ADMKTIHK L* 
A.ADMKTIHK L* 
A.ADMKTIHK L* 
NDRQ...IHR ML* 
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FIG. 12 
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ctcaaatcnc 


ctccgtcgct 


cttactccgc 


catgggrrgac 
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gactccggca 


tggatgctgt 


tcagcgacgt 
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gggacatgat 
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gatggagaag 


attgaaacag 


gtaaaatgct 


gcacagagca 


201 


ttcagcgttt 


tt:ct:att:caa 


ttcaaaatac 


gagntacttc 


tticagcaacg 


251 


guctgcaacc 


aaggngacat 


ttcctttagH 


atggaccaac 


acctgttgca 


301 


gccatccact 


ctacagagaa 


tccgagcHtg 


txcccgaaac 


gcctgagaga 


351 


atgctgcaca 


gaggaxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 
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xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 


xxxxxxxxxx 
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xxxxxxxxxx 


xxxxxxxxxx 
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xxxxxxxxxx 
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xxxxxxxxxx 
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atatagaaac 


751 


acaccctcaa 


ccgaaaagca 


agcctaataa 


ttcgggHtgg 


grcgggtrcca 


801 


ccatcaa.ttg frttttttctt: 


ttiaacaactt 


ttaatcticta 


tttgagcatg 


851 


ttgattcttg 


tctrfctcgtgn 


graagatttt 


gggttt:cgl;t: 


tcagrrgtaa 


901 


taatgaacca 


ttgatggttt 


gcaatttcaa 


gttcctatcg 


acatgnagtg 


951 


atctaaaaaa 
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F I G. I4A 

Adonis paloestino ^-cyclase cDNA #5 Length: 1898 

1 aaaggagtgt tctattaatg ttactgtcgc attcttgcaa cacttatatt 
51 caaactccat tttcttcttt tctcttcaaa acaacaaact aatgtgagca 
101 gagtatctgg ctatggaact acttggtgtt cgcaacctca tctcttcttg 
151 ccctgtgtgg acttttggaa caagaaacct tagtagttca aaactagctt 
201 ataacataca tcgatatggt tcttcttgta gagtagattt tcaagtgaga 
11} gctgatggtg gaagcgggag tagaagttct gttgcttata aagagggttt 
301 tgtggatgaa gaggatttta tcaaagctgg tggttctgag cttttgtttg 
351 tccaaatgca gcaaacaaag tctatggaga aacaggccaa gctcgccgat 
401 aagttgccac caataccttt tggagaatcc gtgatggact tggttgtaat 
451 aggttgtgga cctgctggtc tttcactggc tgcagaagct gctaagctag 
501 ggttgaaagt tggccttatt ggtcctgatc ttccttttac aaataattat 
551 ggtgtgtggg aagacgagtt caaagatctt ggacttgaac gttgtatcga 
601 gcatgcttgg aaggacacca tcgtatatct tgataatgat gctcctgtcc 
651 ttattggtcg tgcatatgga cgagttagtc gacatttgct acatgaggag 
701 ttgctgaaaa ggtgtgtgga gtcaggtgta tcatatctgg attctaaagt 
751 ggaaaggatc actgaagctg gtgatggcca tagccttgta gtttgtgaaa 
801 atgagatctt tatcccttgc aggcttgcta ctgttgcatc tggagcagct 
851 tcagggaaac ttttggagta tgaagtaggt ggccctcgtg tttgtgtcca 
901 aaccgcttat ggggtggagg ttgaggtgga gaacaatcca tacgatccca 
951 acttaatggt attcatggac tacagagact atatgcaaca gaaattacag 
1001 tgctcggaag aagaatatcc aacatttctC tatgtcatgc ccatgtcgcc 
1051 aacaagactt ttttttgagg aaacctgttt ggcctcaaaa gatgccatgc 
1101 cattcgatct actgaagaga aaactgatgt cacgattgaa gactctgggt 
1151 atccaagtta caaaagttta tgaagaggaa tggtcatata ttcctgttgg 
1201 tggttcttta ccaaacacag agcaaaagaa cctagcattt ggtgctgcag 
1251 caagcatggt gcatccagca acaggctatt cggttgtacg gtcactgtca 
1301 gaagctccaa aatatgcttc tgtaattgca aagattttga agcaagataa 
1351 ctctgcgtat gtggtttctg gacaaagtag tgcagtaaac atttcaatgc 
}^} ffgcatggag cagtctttgg ccaaaggagc gaaaacgtca aagagcatTc 
1451 tttcttttTg gattagagct tattgtgcag ctagatattg aagcaaccag 
1501 aacattcttt agaaccttct tccgcttgcc aacttggatg tggtgqqqtt 
1551 tccttgggtc ttcactatca tctttcgatc tcgtcttgtt ttccatgtac 
atgtttgttt tggcgccaaa cagcatgagg atgtcacttg tgagacattt 
1651 gctttcagat ccttctggtg cagttatggt aagagcttac ctcgaaaggt 
1701 agtctcatct attattaaac tctagtgttt caccaaataa atgaggatcc 
VJi £ tc SMtgtg tatatgatca tctctatgta tatcctgtac tctaatctca 
1801 taaagtaaat gccgggtttg atattgttgt gtcaaaccgg ccaatgatat 
1851 aaagtaaatt tattgataca aaagtagttt ttttccttaa aaaaaaaa 
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FIG. I4B 



Adonis palaestina £ -cyclase #5 predicted polypeptide 
TRANSLATE from: 113 to: 1702 Length: 529 amino acids 

1 MELLGVRNLI SSCPWfTFGT RNLSSSKLAY NIHRYGSSCR VDFQVRADGG 

51 SGSRSSVAYK EGFVDEEDFI KAGGSELLFV QMQQTKSMEK QAKLADKLPP 

101 IPFGESVMDL VVIGCGPAGL SLAAEAAKLG LKVGLIGPDL PFTNNYGVWE 

151 DEFKDLGLER CIEHAWKDTI VYLDNDAPVL IGRAYGRVSR HLLHEELLKR 

201 CVESGVSYLD SKVERITEAG DGHSLVVCEN EIFIPCRLAT VASGAASGKL 

251 LEYEVGGPRV CVQTAYGVEV EVENNPYDPN LMVFMDYRDY MQQKLQCSEE 

301 EYPTFLYVMP MSPTRLFFEE TCLASKDAMP FDLLKRKLMS RLKTLGIQVT 

351 KVYEEEWSYI PVGGSLPNTE QKNLAFGAAA SMVHPATGYS VVRSLSEAPK 

401 YASVIAKILK QDNSAYVVSG QSSAVNISMQ AWSSLWPKER KRQRAFFLFG 

451 LELIVQLDIE ATRTFFRTFF RLPTVJMWWGF LGSSLSSFDL VLFSMYMFVL 

501 APNSMRMSLV RHLLSDPSGA VMVRAYLER* 
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FIG. I5A 

DNA sequence of potato cDNA (GenBank R27545) obtained from Nicholas J. 
Provart 

potato.seq Length: 1378 August 2, 1996 13:06 Type: N Check: 605 

1 tagcggnnnn naggatgagt tcaaagatct tggtcttcaa gcctgcattg 

51 aacatgtttg gcgggatacc attgtatatc ttgatgatga tgatcctatt 

101 cttattggcc gtgcctatgg aagagttagt cgccatttac tgcacgagga 

151 gttactcaaa aggtgtgtgg aggcaggtgt tttgtatcta aactcgaaag 

201 tggataggat tgttgaggcc acaaatggcc acagtcttgt agagtgcgag 

251 ggtgatgttg tgattccctg caggtttgtg actgttgcat cgggagcagc 

301 ctcggggaaa ttcttgcagt atgagttggg aggtcctaga gtttctgttc 

351 aaacagctta tggagtggaa gttgaggtcg ataacaatcc atttgacccg 

401 agcctgatgg ttttcatgga ttatagagac tatgtcagac acgacgctca 

451 atctttagaa gctaaatatc caacatttct ctatgccatg cccatgtctc 

501 caacacgagt ctttttcgag gaaacttgtt tggcttcaaa agatgcaatg 

551 ccattcgatc tgttaaagaa aaaattgatg ttacgattga acaccctcgg 

601 tgtaagaatt aaagaaattt atgaggagga atggtcttac ataccagttg 

651 gaggatcttt gccaaataca gaacaaaaaa cacttgcatt tggtgctgct 

701 gctagcatgg ttcatccagc cacaggttat tcagtcgtca gatcactgtc 

751 tgaagctcca aaatgcgcct tcgtgcttgc aaatatatta cgacaaaatc 

801 atagcaagaa tatgcttact agttcaagta ccccgagtat ttcaactcaa 

851 gcttggaaca ctctttggcc acaagaacga aaacgacaaa gatcgttttt 

901 cctatttgga ctggctctga tattgcagct ggatattgag gggataaggt 

951 catttttccg cgcgttcttc cgtgtgccaa aatggatgtg gcagggattt 

1001 cttggttcaa gtctttcttn agcagacctc atgttatttg ccttctacat 

1051 gtttattatt gcaccaaatg acatgagaag aggcttaatc agacatcttt 

1101 tatctgatcc tactggtgca acattgataa gaacttatct tacattttag 

1151 agtaaattcc tcctacaata gttgttgaan nagaggcctc attacttcag 

1201 attcataaca gaaatcgcgg tctctcgagg ccttgtatat aacattttca 

1251 ctaggttaat attgcttgaa taagttgcac agtttcagtt tttgtatctg 

1301 cttctttttt gtccaagatc atgtattgan ccaatttata tacattgcca 

1351 gtatatataa attttataaa aaaaaaaa 

poteps.pep Length: 378 TRANSLATE from: 14 to: 1147 

1 DEFKDLGLQA CIEHVWRDTI VYLDDDDPIL IGRAYGRVSR HLLHEELLKR 

51 CVEAGVLYLN SKVDRIVEAT NGHSLVECEG DVVIPCRFVT VASGAASGKF 

101 LQYELGGPRV SVQTAYGVEV EVDNNPFDPS LMVFMDYRDY VRHDAQSLEA 

151 KYPTFLYAMP MSPTRVFFEE TCLASKDAMP FDLLKKKLML RLNTLGVRIK 

201 EIYEEEWSYI PYGGSLPNTE QKTLAFGAAA SMVHPATGYS VVRSLSEAPK 

251 CAFVLANILR QNHSKNMLTS SSTPSISTQA WNTLWPQERK RQRSFFLFGL 

301 ALILQLDIEG IRSFFRAFFR VPKWMWQGFL GSSLSXADLM LFAFYMFIIA 

351 PNDMRRGLIR HLLSDPTGAT LIRTYLTF* 
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FIG. I5B 



Chimeric lettucefootato lycopene e-cyclase: converts lycopene to 5- 
carotene, the lettuce cDNA converts lycopene to £ -carotene and the 
potato cDNA does not produce an active enzyme 

(amino acids in lower case are from lettuce and those in uppercase 
are from the potato cDNA; an Avail site in ccmron to the two cDNAs 
was used to construct the chimera) 

1 mecfgarnmt atmavftcpt ftdcnirhkf sllkqrrftn lsassslrqi 

51 kcsaksdrcv vdkqgisvac eedyvkaggs elffvqmqrt ksmesqskls 

101 eklaqipign cildlvvigc gpaglalaae saklglnvgl igpdlpftnn 

151 ygvwqdefig Iglegciehs wkdtlvyldd adpirigray grvhrdllhe 

201 ellrrcvesg vsylsskver iteapngysl iecegnitip crlatvasga 

251 asgkfleyel gGPRVSVQTA YGVEVEVDNN PFDPSLMVFM DYRDYVRHDA 

301 QSLEAKYPTF LYAMPMSPTR VFFEETCLAS KDAMPFDLLK KKLMLRLNTL 

351 GVRIKEIYEE EWSYIPVGGS LPNTEQKTLA FGAAASMVHP ATGYSVVRSL 

401 SEAPKCAFVL ANILRQNHSK NMLTSSSTPS ISTQAWNTLW PQERKRQRSF 

451 FLFGLALILQ LDIEGIRSFF RAFFRVPKWM WQGFLGSSLS XADLMLFAFY 

501 MFIIAPNDMR RGLIRHLLSD PTGATLIRTY LTF* 
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FIG. 16 

GAP comparison of Arabidopsis £ -cyclase x potato £ -cyclase (partial) 

blosum62.cmp Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1485 Length: 529 

Ratio: 3.929 Gaps: 1 

Percent Similarity: 79.893 Percent Identity: 76.139 
Match display thresholds for the alignment(s) : 
| = IDENTITY : = 2 . = 1 

151 EDEFNDLGLQKCI EHVWRET I VYLDDDKPITIGRAYGRVSRRLLHEELLR 200 

1 .DEFKDLGLQAc!™ 49 

201 RCVESGVSYLSSKVDSITEASDGLRLVACDDNNVIPCRLATVASGAASGK 250 

||||. II IMIII I II.. I II I: • Mill IIIIMIIII nn 
50 RCVEAGVLYLNSKVDRIVEATNGHSLVECEGDVVIPCRFVTVASGAASGK 99 

251 LLQYEVGGPRVCVQTAYGVEVEVENSPYDPDQMVFMDYRDYTNEKVRSLE 300 

HIM Mill 1 1 1 I I I ! I 1 1 1 : M : 1 1 IMIMIII -III 
100 FLQYELGGPRVSVQTAYGVEVEVDNNPFDPSLMVEMDYRDYVRHDAQSLE 149 

301 AEYPTFLYAMPMTKSRLFFEETCLASKDVMPFDLLKTKLMLRLDTLGIRI 350 

1 . 1 1 1 1 1 1 1 1 1 1 * -1.11111111111 MIMM 1 1 1 1 1 1 • 1 1 1 : 1 1 no 

150 AKYPTFLYAMPMSPTRVFFEETCLASKDAMPFDLLKKKLMLRLNTLGVRI 199 

351 LKTYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSLSEAP 400 

. 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 IIIMIMIIIIIIIIMIIMIIII _ 
200 KEIYEEEWSYIPVGGSLPNTEQKTLAFGAAASMVHPATGYSVVRSLSEAP 249 

401 KYASVIAEI LREETTKQI . ! . . .NSNISRQAWDTLWPPERKRQRAFFLFG 445 

I | Ml MM -I • -II Ml- MM IMMMIIII 

250 KCAFV LAN I LRQNHSKNMLTSSSTPS I STQAWNTLWPQERKRQRS FFLFG 299 
446 LALIVQFDTEGIRSFFRTFFRLPKWMWQGFLGSTLTSGDLVLFALYMFvi 495 

MM. I I IIIIMM lll.lllllllllll.l. II-IM MM _ 

300 LALILQLDIEGIRSFFRAFFRVPKWMWQGFLGSSLSXADLMLFAFYMFII 349 

496 SPNNLRKGLINHLISDPTGATMIKTYLKV 524 

.||.:|:MI l|:||||llh|:|ll „ e 
350 APNDMRRGLIRHLLSDPTGATLIRTYLTF 378 
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FIG. I7A 



Adonis palaestina Ipil 

1 attcatcttc agcagcgctg tcgtactctt tctatatctt cttccatcac 

51 taacagtagt cgccgacggt tgaatcggct attcgcctca acgtcaacta 

101 tgggtgaagt cactgatgct ggaatggatg ctgttcagaa gcggctcatg 

151 ttcgacgacg aatgtatttt ggtggatgag aatgacaagg tcgtcgggca 

201 tgattccaaa tacaactgtc atttgatgga aaagatagag gcagaaaatt 

251 tgcttcacag agccttcagt gttttcttgt tcaactcaaa atatgaattg 

301 cttcttcagc aacgatccgc cacaaaggta acattcccgc tcgtatggac 

351 aaacacatgt tgcagtcatc ctctctttcg tgattccgag ctcatagaag 

401 aaaattatct cggtgtacga aacgctgcac aaagaaagct tttagacgag 

451 ctaggcattc cagctgaaga tgtcccagtt gatgaattta ctcctcttgg 

501 tcgcattctt tacaaagctc catctgacgg caaatgggga gagcacgaat 

551 tggactatct cctatttatt gtccgagatg tgaaatacga tccaaaccca 

601 gatgaagttg ctgatgctaa gtatgttaat cgcgaggagt tgagagagat 

651 actgagaaaa gctgatgctg gtgaagaggg actcaagttg tctccttggt 

701 ttagattggt tgttgataac tttttgttca agtggtggga tcatgtagag 

751 cagggtacga ttaaggaagt tgctgacatg aaaactatcc acaagttgac 

801 ttaagaggac ttctctcctc tgttctacta tttgtttttt gctacaataa 

851 gtgggtggtg ataagcagtt tttctgtttt ctttaattta tggcttttga 

901 atttgcctcg atgttgaact tgtaacatat ttagacaaat atgagacctt 

951 gtaagttgaa tttgaggctg aatttatatt tttgggaaca taataatgtt 

1001 aa 
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FIG. I7B 

Adonis palaestina Ipi2 

1 ttttaaagct ctttcgctcc accaccatca aagccagcca aatttctctg 

51 tacaaaagtt aaaaacaccg ctttgggctt tggcccctcc atatcggaat 

101 ccttgtttac gatacgcatc taaaccagta attctcggtt ttaatttgtt 

151 tcctaaatta ggcccctttc cggaatcccg agaattatgt cgtcgatcag 

201 gattaatcct ttatatagta tcttctccac caccactaaa acattatcag 

251 cttcgtgttc ttctcccgct gttcatcttc agcagcgttg tcgtactctt 

301 tctatttctt cttccatcac taacagtcct cgccgagggt tgaatcggct 

351 gttcgcctca acgtcgacta tgggtgaagt cgctgatgct ggtatggatg 

401 ccgtccagaa gcggcttatg ttcgacgatg aatgtatttt ggtggatgag 

451 aatgacaagg tcgtcggaca tgattccaaa tacaactgtc atttgatgga 

501 aaagatagag gcagaaaact tgcttcacag agccttcagt gttttcttat 

551 tcaactcaaa atacgagttg cttcttcagc aacgatctgc aacgaaggta 

601 acattcccgc tcgtatggac aaacacctgt tgcagccatc ccctcttccg 

651 tgattccgaa ctcatagaag aaaattttct cggggtacga aacgctgcac 

701 aaaggaagct tttagacgag ctaggcattc cagctgaaga cgtaccagtt 

751 gatgaattca ctcctcttgg tcgcattctt tacaaagctc catctgacgg 

801 aaaatgggga gagcacgaac tggactatct tctgtttatt gtccgagatg 

851 tgaaatacga tccaaaccca gatgaagttg ctgacgctaa gtacgttaat 

901 cgcgaggagt tgaaagagat actgagaaaa gctgatgcag gtgaagaggg 

951 aataaagttg tctccttggt ttagattggt tgtggataac tttttgttca 

1001 agtggtggga tcatgtagag gaggggaaga ttaaggacgt cgccgacatg 

1051 aaaactatcc acaagttgac ttaagagaaa gtctcttaag ttctactatt 

1101 tggtttttgc ttcaataagt ggatggtgat gagcagtttt tatgcttcct 

1151 ttaattttgg cttttcaatt tgctttatgt gttgaacttg taacatattt 

1201 agtcaaatat gagaccttgt gagttgaatt tgaggttata tttatagttt 

1251 tgggaacata aaaaaaaaaa 
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FIG. I8A 

"'T^^^^ttt^ acctggcccg gcggcagtcc 
51 gatgccgcga tgcttcgttc gttgctcaga ggcctcacgc atatcccccg 
101 EgtgaaEtcc gEccagcagc ccagctgtgc acacgcgcga ctccagttta 
151 aqctcaggag catgcagatg acgctcatgc agcccagcat ctcagccaat 
201 ctgtcgcgcg ccgaggaccg cacagaccac atgaggggtg caagcacctg 
251 qgcaggcggg cagtcgcagg atgagctgat gctgaaggac gagtgcatct 
301 tggtggatgt tgaggacaac atcacaggcc atgccagcaa gctggagtgt 
351 cacaagttcc taccacatca gcctgcaggc ctgctgcacc gggccttctc 
40i tqtgttcctg tttgacgatc aggggcgact gctgctgcaa cagcgtgcac 
451 gcteaaaaat caccttccca agtgtgtgga cgaacacctg ctgcagccac 
501 cctttacatg ggcagacccc agatgaggtg gaccaactaa gccaggtggc 
551 cgacggalcl gtacctggcg caaaggctgc tgccatccgc aagttggagc 
601 acqagctggg gataccagcg caccagctgc cggcaagcgc gtttcgcttc 
651 ct?a?gcgtt tgcactactg tgccgcggac gtgcagccag ctgcgacaca 
701 atcaacactc tggggcgagc acgaaatgga ctacatcttg ttcatccggg 
751 ccalmtcac cttggcgccc aa^cctgacg aggtggacga agtcaggtac 
801 gtgacgcaag aggagctgcg gcagatgatg cagccggaca acgggctgca 
Ml atqqtcqccq tggtttcgca tcatcgccgc gcgcttcctt gagcgttggt 
901 qqqctqacct ggacgcggcc ctaaacactg acaaacacga ggattgggga 
951 Scatc l?atoa?ga agcgtgaaag cagaagctgc aggatgtgaa 
100 gaHlcgtcat ggggtggaat tgcgtacttg gcagcttcgt atctcctttt 
inM trtaanactq aacctgcagt caggtcccac aaggtcaggt aaaatggctc 
1101 gaSgt Iccgtcactt tttgtcgcgt atactgaact ccaagaggtc 
1151 aaaaaaaaaa aaaaa 
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FIG. I8B 



Haematococcus pluviolis Ipi2 

1 tggaacctgg cccggcggca gtccgatgcc gcgatgcttc gttcgttgct 

51 cagaggcctc acgcatatcc cgcgcgtgaa ctccgcccag cagcccagct 

101 gtgcacacgc gcgactccag tttaagctca ggagcatgca gctgcttgcc 

151 gaggaccgca cagaccacat gaggggtgca agcacctggg caggcgggca 

201 gtcgcaggat gagctgatgc tgaaggacga gtgcatctta gtggatgctg 

251 acgacaacat cacaggccat gccagcaagc tggagtgcca caaattccta 

301 ccacatcagc ctgcaggcct gctgcaccgg gccttctctg tgttcctgtt 

351 tgacgaccag gggcgactgc tgctgcaaca gcgtgcacgc tcaaaaatca 

401 ccttcccaag tgtgtggacg aacacctgct gcagccaccc tctacatggg 

451 cagaccccag atgaggtgga ccaactaagc caggtggccg acggcacagt 

501 acctggcgca aaagctgctg ccatccgcaa gttggagcac gagctgggga 

551 taccagcgca ccagctgccg gcaagcgcgt ttcgcttcct cacgcgtttg 

601 cactactgtg ccgcggacgt gcagccggct gcgacacaat cagcgctctg 

651 gggcgagcac gagatggact acatcttatt catccgggcc aacgtcacct 

701 tggcgcccaa ccctgacgag gtggacgaag tcaggtacgt gacgcaagag 

751 gagctgcggc agatgatgca gccggacaac gggttgcaat ggtcgccgtg 

801 gtttcgcatc atcgccgcgc gcttccttga gcgttggtgg gctgacctgg 

851 acgcggccct aaacactgac aaacacgagg attggggaac ggtgcatcac 

901 atcaacgaag cgtgaaggca gaagctgcag gatgtgaaga cacgtcatgg 

951 ggtggaattg cgtacttggc agcttcgtat ctcctttttc tgagactgaa 

1001 cctgcagagc tagagtcaat ggtgcatcat attcatcgtc tctcttttgt 

1051 tttagactaa tctgtagcta gagtcactga tgaatccttt acaactttca 

1101 aaaaaaaaa 
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FIG. I9A 

Lactuca sativa Ipil ^ ^ . . . 

1 tgccaaaatg ttgaaatttc ccccttttaa aaccattgct accatgatct 

51 cttctccata ttcttccttc ttgctgcctc ggaaatcttc tttccctcca 

101 atgccgtctc tcgcagccgc tagtgttttc ctccaccctc tttcgtctgc 

151 cgctatgggc gattccagca tggatgctgt ccagcgacgt ctcatgttcg 

201 atgacgaatg cattttggtg gatgagaatg acaaagtggt tggccatgat 

251 actaaataca attgtcattt gatggagaag attgaaaagg gaaatatgct 

301 acacagagca ttcagtgtgt tcttgttcaa ctcgaaatat gaattactcc 

351 ttcagcaacg ttctgcaacc aaggtgactt tccctttggt atggacaaac 

401 acgtgttgca gccatccact atacagggag agtgaactta ttqacgaaaa 

451 cgcccttggg gtgaggaatg ctgcacagag gaagctcctg gatgaactcg 

501 gcatccctgg agcagatgtt ccggttgatg agttcactcc attgggtcgc 

551 attctataca aggccgcatc ggatqgaaag tqqggaqaac atgaacttqa 

601 ttacctgctg tttatggtac gtgatgttgg tttggatccg aacccagatg 

651 aagtgaaaga tgtaaaatat gtgaaccgqg aagaqctgaa ggaattqgta 

701 aggaaggcgg atgctggtga agagggtgtg aagctgtccc cqtggttcaa 

751 attgattgtc gataatttct tgtttcagtg gtgqgatcga ctccataagg 

801 gaaccctaac cgaagctatt gatatgaaaa caatccacaa actcacataa 

851 aaacactaca ctagtaggag agaggattat atqagatatt tgttatatgt 

901 gaaattgaaa ttcagatgaa tgcttgtatt tatttctatt tggacaaact 

951 tcaacttctt tttgctacct tatcagaaaa aaaaa 

FIG. I9B 

Lactuca sativa Ipi2 

1 tattcgcttc aaaatctctt ccattaactg ctcaaatctc caccttcgcc 

51 ggtcttaatc tccgccggcg cactttcacc accataaccg ccqccatggg 

101 tgacgattcc ggcatggacg ctgtccagag acgtctcatg tttgatgatg 

151 aatgcatttt ggttgatgaa aatgacaatg ttcttgggca tgataccaaa 

201 tacaattgtc acttgatgga gaagattgag aaagataatt tgcttcatag 

251 agcattcagt gtatttttat tcaattcaaa atacgaatta ctccttcagc 

301 aaaggtcaga aaccaaggtg acatttcctt tggtatggac aaacacctgt 

351 tgcagccatc cactatacag agaatcggag ttaattcccg aaaatqccct 

401 tggggtcaga aatgctgcac agaggaagct tctagatgaa ctcggtatcc 

451 ctgctgaaga tgttccagtt gatgagttca caactttagg tcgcatgttg 

501 tacaaggctc catctgatgg aaaatggggt gaacatgaag ttgattacct 

551 actcttcctc gtgcgtgacg ttgccgtgaa cccaaaccct gatgaggtgg 




601 

701 gglggacaac ?tct?gt?ca aatggtgqqa tcatgtccaa aaggggacac 

751 tcaatgaagc aattgacatg aaaaccattc ataagttgat atgaaaaatg 

801 gttaatattt atggtggtgg tttggaqcta ataatttgtq tgttcaagtc 

851 tcggtccttc tttttllaac gtttitittt tttctttta! tgggaqtgtt 

901 tattgtgtac ttgtaacgta ggccctttgg ttacgcttta agagtttaat 

951 aaagaaccac cgttaattta aaaaaaaaaa aaaaaaaa 
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FIG.20 

Chlamydcmonas reinhardtii Ipil 

(Note: the isomerase cDNA probably ends at ca. base 1103; the 
second half of the cDNA is similar to extensm and other 
hydroxyproline-rich structural proteins) 

1 qgcacgagct cgagtttgtt ttaccatgac atcgggaatt tggaagcttg 
51 aactacctca attactcaag taactcgcgg caacacattt cgcgcgccat 
101 cqctgttttc tctgctccag ctaccgagca gcattgcttt agatcgcttt 
151 gatgtcataa actcccact! atatgagatc cagtttcatc gagcccaagc 
201 ccagagcgca acctgtctta agccgcggca qggcgtccat gcgcctcgcg 
251 caaagccgtg ctctcgttgc gcgtgtcagc tccgccctgt ggccgggagc 
301 aggactttca caggctcaaa gcgttgcgqt gcgaatggcg agttcgtcaa 
351 ccfqggaagg cacgggcctg agccaggatg acttcatgca gcgggacgag 
401 tgcttggtgg tggacgagca ggaccggctg ctaggcaccg ccaacaagta 
451 cgactgccac cgcttcgagg cggccaaggg ccagccctgc ggccgcctgc 
501 accgcgcctt ctccgtgttc ctqttcagcc ccgacqgccg actgctgctg 
551 cagcagcgcg cagccagcaa ggtgacgttc ccgggtgtgt ggaccaacac 
601 ctgctgctcg cacccgctgg cgggccaggc gccggacgag gtggacctqc 
651 cggcggcggt agcctcgggc caggtgccgg gcatcaaggc ggcggcggtg 
701 cgcaagctgc agcacgagct ggggataccg ccggagcagg ttcccgcctc 
751 ctccttctcc ttcctcacgc gtctgcacta ctgcgccgcc gacaccgcca 
801 cgcacggccc ggcggcggag tggggcgagc acgaggtgga ctacgtgctq 
851 ttcgtgcggc cgcagcagcc cgtcagcctg caqcccaacc cagacgaggt 
901 ggacgccacg cgctacgtga cgctgccgga gcttcagtcc atgatggcgg 
951 accccggcct cagctggaqc ccctggttcc gcatcctggc cacacagccc 
1001 gccttcctgc ccgccfggtg gggcgacctg aagcggcgct gqcgcccggg 
1051 cggcagccga ctgtaggact ggggcaccat ccaccgcgtc atgtgaagaa 
1101 aaaggggaag caggggcggg agcgggggat gaatgggaat gtgaatgcga 
1151 ttgtgatgcg gcgtqggafg aggtclgaag acagggggaa aatcgggggg 
1201 cgggcgtgag cgtgtgtgta cgtgagcgac aaagccggga ggcgqaccgc 
1251 gcgatgggta catqtgtqtg cggagqgtcg gtgggtcggt cggttgcgcg 
1301 gcatagcgtg ttgtgtgfgt gcggclgcgc gggtatgtgg gcacccgggc 
1351 acggaggaga aggcacacgc agglggcgcg gaggtgfgtc afgfgccatg 
1401 ggcgggcctc actcctgglc glgcccagtg gtctcgtggg cagagtggca 
1451 gqggctgcac ccatatgagc ggcgcactgc cgcgctgggc taagtcctta 
1501 tcacttqgtg aggtggggcg aggtggctqt qggcggcggg cgcagtggca 
1551 gaaggacacg gtgtglgagc qgtggagctc !ggccgtgcc ggccgtgagg 
1601 ggcggatagc gatatgacgt tgtgcttggc cgctgtaatg cgggagaa^g 
1651 Igcaggccgc gagaagcggg cggtggcagg aggccgcagg ctgcagcacc 
1701 cqttggggag gtgccgcctg caggcgcggc gccgggcggg cctgagtaat 
1751 qqqcgcc!ga gtagtggcgg ccacaggagg cgcaggaggc agcagcagga 
1801 ggacgagctg gagggacccg ttggcaaccc aaggttgcgc g!gtaaca!a 
1851 gtggccatac aaaaaaaaaa aaaa 
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FIG. 21 A 



Tagetes erecta Ipil 

1 ccaaaaacaa ctcaaatctc 
gactccggca tgqatgctqt 
cattttggtg gatgagtgtg 
attgtcactt qatqgaqaag 
ttcagcgttt ttctattcaa 
gtctqcaacc aaggtgacat 
gccatccact ctacagagaa 
cjtaagaaatg ctgcacagag 
tgaaqatgtt cccqttgatc 
aggctccatc tgatggaaag 
ttcatagtga gagacgttgc 
tatcaaatat gtganccang 
atqcggqqga ggagggtttg 
qataacttct tgttcaagtg 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 



tgaagcaatt gatatgaaaa 
tcaaccgaaa agttcaaqcc 
caattgtttt tttcttttaa 
ttcttgtctt ttgtgtgtaa 
gaaccattga tggtttgcaa 
aaaaaa 



ctccgtcgct 
tcagcgacgt 
acaatgtggt 
attgaaacag 
ttcaaaatac 
ttcctttagt 
tccgaqcttg 
gaaqctgttg 
agtttactcc 
tgqggagaac 
tgtaaacccg 
aagagttaaa 
aagctgtctc 
gtgqgatcat 
ccatacacaa 
taataattcg 
gaaqttttaa 
qattttgqqt 
tttcaagiic 



cttactccgc 
ctcatgtttq 
gggacatgat 
gtaaaatqct 
gagttacttc 
atggaccaac 
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F I G. 23A 

Comparison using GAP program of the Genetics Computer Group 

Gap Weight: 50 Average match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 17392 Length: 1904 

Ratio: 9.411 Gaps: 3 

Percent Similarity: 95.331 Percent Identity: 95.331 
Match display thresholds for the al ignment(s) : 
" = IDENTITY : = 5 . = 1 



Adonis palaestina £ -cyclase #3 x Adonis paloestina £ -cyclase #5 

• • • • • 

1 gagagaaaaagagtgttatattaatgttactgtcgcattcttgcaacac. 49 

in miiii liiiiiiiiiMmiiiiiiiimiiiii M 

1 aaaggagtgttctattaatgttactgtcgcattcttgcaacact 44 



50 . atattcagactccattttcttgttttctcttcaaaacaacaaactaatg 98 

'WIN Hllllliljljl IMlJIlllllllllllllllllllI 0A 

itattcaaactccattttcttcttttctcttcaaaacaacaaactaatg 94 



45 ta 



* • • • • 

99 tga.cggagtatctagctatggaactacttggtgttcgcaacctcatctc 147 

111 I Willi] T 1 J I J if 1 1 1 J 1 1 J JTT J T J J 111 1 1 1 111 IJljr 

tqaqcaqaqtatctqqctatggaactacttqqtqttcqcaacctcatctc 



95 tgagcagagtatctggctatggaactacttggtgttcgcaacctcatctc 144 

• • • • • 

148 ttcttgccctgtctggacttttggaacaagaaaccttagtagttcaaaac 197 

,« 11 HI' 1 1 11U JTT 1 1 1 11 JTT 1 1 1 1 IT 1 1 1 1 1 1 1 IT I 111 I II Mil 

145 ttcttgccctgtgtggacttttggaacaagaaaccttagtagttcaaaac 194 



198 taqcttataacatacatcqatatqqttcttctt 



7 

txga 



atgg 



ccttc 



agagtagattttcaa 247 



gtagagtagattttcaa 244 



195 tagcttataacatacal 

2 f If tHffffiffi^ Z 

245 gtgagagctgatggtggaagcgggagtagaagttctgttgcttataaaga 294 
298 347 
gggttttgtggatgaagaggattttatcaaagctggtggttctgagcttt 344 



295 gggi 



* w » m * 

348 tgtttgtccaaatgcagcaaacaaagtctatggagaaacaggccaagctc 397 

,« ITlliT j i ! U 1 1 T i I T 1 1 1 1 1 1 1 1 TJ 1 J f JTT I T 1 1 1 1 1 TT 1 1 1 1 T K . ,„„ 

345 tgtttgtccaaatgcagcaaacaaagtctatggagaaacaggccaagctc 394 
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FIG. 23 B 
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^ -^-+--^-++--...+11 Ml JTTI ITITJ J IT^IT 1111)11) Ml 

645 ctgtccttattggtcgtgcatatggacgagttagtcgacatttgctacat 694 
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898 tqtccaaactqctta 
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ggtgatggccatagccttgtagttt 794 
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taqqtqqccctcqtqtttq 897 
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FIG. 23C 

• • • • • 

948 atcccaacttaatqqtatttatqqactacaqaqactatatqcaacaqaaa 997 
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atcccaacttaatggtatttatggactacagagactatatgcaacag. 

iliiiiiiiiiimiiii imiijiiiTmiiiiimiiiii... QM 
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, ^ TjlTl 1 1 1 1 1 IT I U1J111 J JT ITT ! 1 1 1 ijTl ] JTT t M M 1 1 ITMT 
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1098 ccatgcctttcgatctactgaagagaaaactaatqtcacqattqaagact 1147 
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1095 ccatgccattcgatctactgaagagaaaactgatgtcacgattgaagact 1144 

• • • ♦ • 
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n I in nil II n 

1195 tgttggtggttct 



TlTIIIIITIIIIIITIIIimiT 
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t1 
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1498 aaccagaacgttctttagaaccttcttccgcttgccaacttggatgtggt 1547 

Anc :mmn 11 iJll it 1 1 1 1 1 J i J 1 1 it n jTi s n i JlTTiiTj ; 1K/M 

1495 aaccagaacattctttagaaccttcttccgcttgccaacttggatgtggt 1544 



1543 qqggtttccttgggtcttcactatcatctttcgatcttgtattgttttcc 1597 

TTTTJ 111 I J JTTTJ I J 1 1 1 II IJ.1 1 1 IJ.1J. Ill J I J 1) 111111111 11M4 

1545 ggggtttccttgggtcttcactatcatctttcgatctcgtcttgttttcc 1594 

• • • • • 

1598 atgtacatgtttgttttqqccccgaacaqcatqaqqatqtcacttqtqaq 1647 



...... a Willi" I 1 1 1 1 1 ITI I JTITTI JTJ 1 1 1 J JTJT1T 1<U1 „ 

1595 atgtacatgtttgttttggcgccaaacagcatgaggatgtcacttgtgag 1644 

• • • • • 

1648 acatttgctttcagatccttctggtgcagttatggttaaagcttacctcg 1697 

1C/1C 1 1 11J.1TI J J 1 1 ITIJI I J J I JTTJTI ITJ 1 1 JTTJ I ITIIIIIIIIT lfiM 

1645 acatttgctttcagatccttctggtgcagttatggtaagagcttacctcg 1694 

• • • • • 

1698 aaaggtaatc. . .tgttttatgaaactatagtgtctcattaaataaatga 1744 

mi ill li 11 llijjnil liiliyn 1 1 ill i ill i ,™ 

1695 aaaggtagtctcatctattattaaactctagtgtttcaccaaataaatga 1744 

• * • • • 

1745 qqatccttcgtatatgtatatgatcatctctatgtatatcctatattcta 1794 



1745 



qqatccttcgtatatgtatatgatcatctctatgtatatcctatattct 

TTIIimiT II lIlllllllllllllllllTllllllli II Ml 

ggatccttcgaatgtgtatatgatcatctctatgtatatcctgtactct 



a 1794 



1795 atctcataaagtaatcgaaaattcattgataqaaaaaaaaaaaaaaaaaa 1844 



IIIIIIIIITJII T 

tctcataaagtaaatgc 



1795 atctcataaagtaaatgccgggtttgatattgttgtgtcaaaccggccaa 1844 

1845 aaaa ! ". ' 1 * 1848 

1845 tgitataaagtaaatttattgatacaaaagtagtttttttttttaaaaaa 1894 



FIG. 23D 
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GAP program of Genetics Computer Group ^ ' ^* 24 A 
blosum62.cmp 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 
Quality: 2728 Length: 530 

Ratio: 5,147 Gaps: 0 

Percent Similarity: 99,623 Percent Identity: 99.057 
Match display thresholds for the al ignment(s) : 
| = IDENTITY : = 2 . = 1 

Adonis palaestina £ -cyclase #3 x Adonis palaestino z -cyclase 

1 MELLGVRNLISSCPVWTFGTRNLSSSKLAYNIHRYGSSCRVDFQVRADGG 50 

1 MELLGVRNLISSCPVWTFGTRNLSSSKLAYNI^ 50 

51 SGSRTSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 

IIILMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

51 SGSRSSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 
101 IPFGESVMDLVVIGCGPAGLSLAAEAAKLGLKVGLIGPDLPFTNNYGVWE 150 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIMIIIIIIIIIIMIII 

101 IPFGESVMDLVVIGCGPAGLSLAAEAAKLGLKVGLIGPDLPFTNNYGVWE 150 

151 DEFKDLGLERCIEHAWKDTIVYLDNDAPVLIGRAYGRVSRHLLHEELLKR 200 

IIIIIIIIIIIIIIIIMMIMIIIIIIIIIIIIIMIIIIIIIIIIII 
151 DEFKDLGLERCIEHAWKDTIVYLDNDAPVLIGRAYGRVSRHLLHEELLKR 200 

201 CVESGVSYLNSKVERITEAGDGHSLVVCENDI FI PCRLATVASGAASGKL 250 

IIIIIIIIMIMIIIIIIIIIIIIIIIhlllllllllllllllllll 

201 CVESGVS Y LDSKV ERI TEAGDGHS LVVC ENE I F I PCRLATVASGAASGKL 250 

251 LEYEVGGPRVCVQTAYGVEVEVENNPYDPNLMVFMDYRDYMQQKLQCSEE 300 

llllllllllllllllllllllllllllllllllllllllllllllllll 
251 LEYEVGGPRVCVQTAYGVEVEVENNPYDPNLMVFMDYRDYMQQKLQCSEE 300 

301 EYPTFLYVMPMSPTRLFFEETCLASKDAMPFDLLKRKL>1SRLKTLGIQVT 350 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 
301 EYPTFLYVMPMSPTRLFFEETCLASKDAMPFDLLKRKLMSRLKTLGIQVT 350 
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351 KIYEEEWSYI PVGGS LPNTEQKN LAFGAAASMVHPATG YS VVRS LS EAPK 400 

1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 
351 KVYEEEWSYIPVGGSLPNTEQKNU\FGAAASMVHPATGYSVVRSLSEAPK 400 

401 YASVIAKILKQDNSAYVVSGQSSAVNISMQAWSSLWPKERKRQRAFFLFG 450 
401 YASVIAkIlKQDNSM 450 

451 LELIVQLDIOTRTFFRTFFRLPTVM^WGFLGSSLSSFDLVLFSNIYMFVL 500 
451 LELIVQIJtfEAT^^ 500 

501 APNSMRMSLVRHLLSDPSGAVMVKAYLER* 530 
501 AFlllSM^ 530 

FIG. 24B 



0 
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FIG. 28A 

GAP of: Arabidopsis epsilon cyclase to Lettuce epsilon cyclase 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1837 Length: 534 

Ratio: 3.499 Gaps: 3 

Percent Similarity: 76.381 Percent Identity: 69.905 
Match display thresholds for the alignment(s): 
| = IDENTITY : = 2 . = 1 

Arabidopsis x Lettuce 

1 MECVGARNF.AAMAVSTFPSW. . .SCRRKFPVVKRYSYRNIRFGLCSVRA 46 

III INI I III I I • -Ml ..|. : |: 
1 MECFGARNMTATMAVFTCPRFTDCNIRHKFSLLKQRRFTNLSASSSLRQI 50 

47 SGGGSSGSESCVAVREDFADEEDFVKAGGSEILFVQMQQNKDMDEQSKLV 96 

I llllhlllllll: Mill. I |: llll 

51 KCSAKSDRCVVDKQGISVADEEDYVKAGGSELFFVQMQRTKSMESQSKLS 100 

97 DKLPPISIGDGALDHVVIGCGPAGLALAAESAKLGLKVGLIGPDLPFTNN 146 

:M I II- M lllllllllllllllllllll lllllllllllll 
101 EKLAQIPIGNCILDLVVIGCGPAGLALAAESAKLGLNVGLIGPDLPFTNN 150 

147 YGVWEDEFNDLGLQKCIEHVWRETIVYLDDDKPITIGRAYGRVSRRLLHE 196 

Hlhlll III: MM |::|:||||| I |||||||| I MM 
151 YGVWQDEFIGLGLEGCIEHSWKDTLVYLDDADPIRIGRAYGRVHRDLLHE 200 

197 ELLRRCVESGVSYLSSKVDsiTEASDGLRLVACDDNNAIPCRLATVASGA*246 

MIMIIMMMIIMI: MM -I |: h I IMIIIIIIIM 

201 ELLRRCVESGVSYLSSKVERITEAPNGYSLIECEGNITIPCRLATVASGA 250 
247 ASGKLLQYEVGGPRVCVQTAYGVEVEVENSPYDPDQMVFMDYRDYTNEKV 296 

oc llll hIMIIIIIIIIMhIIMII.IIMI MN'I ••• I 

251 ASGKFLEYELGGPRVCVQTAYGIEVEVENNPYDPDLMVFMDYRDFSKHKP 300 

297 RSLEAEYPTFLYAMPMTKSRLFFEETCLASKDVMPFDLLKTKLMLRLDTL 346 

1 1 1 1 - ! I ! 1 1 1 I I- .::MMMII|:: ll|.|||.||| II : ' 
301 ESLEAKYPTFLYVMAMSPTKIFFEETCLASREAMPFNLLKSKLMSRLKAM 350 
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FIG. 28B 



347 6IRILKTYEEEWSYIPVGGSLPNTEQKNLAF6AAASMVHPATGYSVVRSL 396 

Mil :|IIIIIIMMMIIIIimiMIIIIIIIMIMIIimil 

351 GI RITRTYEEEWSY I PVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSL 400 
397 SEAPKYASVIAEILREETTKQINS NISRQAWDTLWPPERKRQRAF 441 

Mil 1 1 - 1 1 1 - ! I h = -h- I llhllhllll 1 1 1 1 M 1 1 

401 SEAPNYAAVIAKILRQDQSKEMISLGKYTNISKQAWETLWPLERKRQRAF 450 

442 FIFGLALIVQFDTEGIRSFRCT^ 491 

Mill. II I II MIIIIIMIMM IIIIM-I l|::|lll 
451 FLFGLSHI VLXDLEGTRTFFRTFFRLPKIMJWGFLGSSLSSTDLI I FALY 500 

492 MFVISPNNLRKGLINHLISDPTGATMIKTYLKV* 525 

IIIM-II |: IMIIIIIIM II :| 
501 MFVIAPHSLRMELVRHLL5DPTGATMVKAYLTI* 534 



SUBSTITUTE SHEET (RULE 26) 



WO 99/63055 



SEQUENCE LISTING 



PCTYUS99/12121 



<110> CUNNINGHAM JR., FRANCIS X. 
SUN, ZAIREN 

<120> GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM AND 
METHODS OF USE THEREOF 
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<150> 09/088,725 
<151> 1998-06-02 

<160> 61 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1860 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 

<222> (109) . . (1680) 
<400> 1 

ACAAAAGGAA ATAATTAGAT TCCTCTTTCT GCTTGCTATA CCTTGATAGA ACAATATAAC 60 

AATGGTGTAA GTCTTCTCGC TGTATTCGAA ATTATTTGGA GGAGGAAA ATG GAG TGT 117 

Met Glu Cys 
1 

GTT GGG GCT AGG AAT TTC GCA GCA ATG GCG GTT TCA ACA TTT CCG TCA 165 
Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr Phe Pro Ser 
5 10 15 

TGG AGT TGT CGA AGG AAA TTT CCA GTG GTT AAG AGA TAC AGC TAT AGG 213 
Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr Ser Tyr Arg 
20 25 30 35 

AAT ATT CGT TTC GGT TTG TGT AGT GTC AGA GCT AGC GGC GGC GGA AGT 2 61 

Asn He Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly Gly Gly Ser 
40 45 so 

TCC GGT AGT GAG AGT TGT GTA GCG GTG AGA GAA GAT TTC GCT GAC GAA 309 
Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe Ala Asp Glu 
55 60 65 

GAA GAT TTT GTG AAA GCT GGT GGT TCT GAG ATT CTA TTT GTT CAA ATG 357 
Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe Val Gin Met 
70 75 so 

CAG CAG AAC AAA GAT ATG GAT GAA CAG TCT AAG CTT GTT GAT AAG TTG 4 05 
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Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val Asp Lys Leu 

85 90 95 

CCT CCT ATA TCA ATT GGT GAT GGT GCT TTG GAT CAT GTG GTT ATT GGT 4 53 

Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val Val He Gly 

100 105 110 115 

TGT GGT CCT GCT GGT TTA GCC TTG GCT GCA GAA TCA GCT AAG CTT GGA 501 

Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 

120 125 130 

TTA AAA GTT GGA CTC ATT GGT CCA GAT CTT CCT TTT ACT AAC AAT TAC 54 9 

Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 

135 140 145 

GGT GTT TGG GAA GAT GAA TTC AAT GAT CTT GGG CTG CAA AAA TGT ATT 597 

Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys He 

150 155 160 

GAG CAT GTT TGG AGA GAG ACT ATT GTG TAT CTG GAT GAT GAC AAG CCT 64 5 

Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp Asp Lys Pro 

165 170 175 

ATT ACC ATT GGC CGT GCT TAT GGA AGA GTT AGT CGA CGT TTG CTC CAT 693 

He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His 

180 185 190 195 

GAG GAG CTT TTG AGG AGG TGT GTC GAG TCA GGT GTC TCG TAC CTT AGC 741 

Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser 

200 205 210 

TCG AAA GTT GAC AGC ATA ACA GAA GCT TCT GAT GGC CTT AGA CTT GTT 789 

Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu Arg Leu Val 

215 220 225 

GCT TGT GAC GAC AAT AAC GTC ATT CCC TGC AGG CTT GCC ACT GTT GCT 837 

Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala Thr Val Ala 

230 235 240 

TCT GGA GCA GCT TCG GGA AAG CTC TTG CAA TAC GAA GTT GGT GGA CCT 885 

Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro 

245 250 255 

AGA GTC TGT GTG CAA ACT GCA TAC GGC GTG GAG GTT GAG GTG GAA AAT 933 

Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn 

260 265 270 275 

AGT CCA TAT GAT CCA GAT CAA ATG GTT TTC ATG GAT TAC AGA GAT TAT 981 

Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr 

280 285 - 290 

ACT AAC GAG AAA GTT CGG AGC TTA GAA GCT GAG TAT CCA ACG TTT CTG 1029 

Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu 

295 300 305 

TAC GCC ATG CCT ATG ACA AAG TCA AGA CTC TTC TTC GAG GAG ACA TGT 1077 

Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys 

310 315 320 

TTG GCC TCA AAA GAT GTC ATG CCC TTT GAT TTG CTA AAA ACG AAG CTC 1125 

Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu 
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325 330 335 

ATG TTA AGA TTA GAT ACA CTC GGA ATT CGA ATT CTA AAG ACT TAC GAA 117 3 

Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys Thr Tyr Glu 

340 345 350 355 

GAG GAG TGG TCC TAT ATC CCA GTT GGT GGT TCC TTG CCA AAC ACC GAA 1221 

Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 

360 365 370 

CAA AAG AAT CTC GCC TTT GGT GCT GCC GCT AGC ATG GTA CAT CCC GCA 1269 

Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
375 380 385 

ACA GGC TAT TCA GTT GTG AGA TCT TTG TCT GAA GCT CCA AAA TAT GCA 1317 

Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Tyr Ala 
390 395 400 

TCA GTC ATC GCA GAG ATA CTA AGA GAA GAG ACT ACC AAA CAG ATC AAC 1365 

Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys Gin lie Asn 
405 410 415 

AGT AAT ATT TCA AGA CAA GCT TGG GAT ACT TTA TGG CCA CCA GAA AGG 1413 

Ser Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro Pro Glu Arg 

420 425 430 435 

AAA AGA CAG AGA GCA TTC TTT CTC TTT GGT CTT GCA CTC ATA GTT CAA 14 61 

Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu lie Val Gin 

440 445 450 

TTC GAT ACC GAA GGC ATT AGA AGC TTC TTC CGT ACT TTC TTC CGC CTT 1509 

Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe Phe Arg Leu 
455 460 465 

CCA AAA TGG ATG TGG CAA GGG TTT CTA GGA TCA ACA TTA ACA TCA GGA 1557 

Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Gly 
470 475 480 

GAT CTC GTT CTC TTT GCT TTA TAC ATG TTC GTC ATT TCA CCA AAC AAT 1605 

Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser Pro Asn Asn 
485 490 495 

TTG AGA AAA GGT CTC ATC AAT CAT CTC ATC TCT GAT CCA ACC GGA GCA 1653 

Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro Thr Gly Ala 

500 505 510 515 

ACC ATG ATA AAA ACC TAT CTC AAA GTA TGATTTACTT ATCAACTCTT 1700 
Thr Met lie Lys Thr Tyr Leu Lys Val 
520 

AGGTTTGTGT AT AT AT AT GT TGATTTATCT G AAT AAT CGA TCAAAGAATG GTATGTGGGT 17 60 
TACTAGGAAG TTGGAAACAA ACATGTATAG AATCTAAGGA GTGATCGAAA TGGAGATGGA 1820 
AAC G AAAAG A AAAAAATCAG TCTTTGTTTT GTGGTTAGTG I8 60 



<210> 2 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
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<400> 2 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
15 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro lie Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 
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Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin lie Asn Ser Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

lie Val Gin Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met lie Lys Thr Tyr Leu Lys Val 
515 520 

<210> 3 
<211> 956 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 

GCTCTTTCTC CTCCTCCTCT ACCGATTTCC GACTCCGCCT CCCGAAATCC TTATCCGGAT 60 
TCTCTCCGTC TCTTCGATTT AAACGCTTTT CTGTCTGTTA CGTCGTCGAA GAACGGAGAC 120 
AGAATTCTCC GATTGAGAAC GAT GAG AG AC CGGAGAGCAC GAGCTCCACA AACGCTATAG 180 
ACGCTGAGTA TCTGGCGTTG CGTTTGGCGG AGAAATTGGA GAGGAAGAAA TCGGAGAGGT 24 0 

CCACTTATCT AATCGCTGCT ATGTTGTCGA GCTTTGGTAT CACTTCTATG GCTGTTATGG 300 
CTGTTTACTA CAGATTCTCT TGGCAAATGG AGGGAGGTGA GATCTCAATG TTGGAAATGT 360 
TTGGTACATT TGCTCTCTCT GTTGGTGCTG CTGTTGGTAT GGAATTCTGG GCAAGATGGG 4 20 

CTCATAGAGC TCTGTGGCAC GCTTCTCTAT GGAATATGCA TGAGTCACAT CACAAACCAA 4 80 
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GAGAAGGACC 


GTTTGAGCTA 


AACGATGTTT 


TTGCTATAGT 


GAACGCTGGT 


CCAGCGATTG 


540 


GTCTCCTCTC 


TTATGGATTC 


TTCAATAAAG 


GACTCGTTCC 


TGGTCTCTGC 


TTTGGCGCCG 


600 


GGTTAGGCAT 


AACGGTGTTT 


GGAATCGCCT 


ACATGTTTGT 


CCACGATGGT 


CTCGTGCACA 


660 


AGCGTTTCCC 


TGTAGGTCCC 


ATCGCCGACG 


TCCCTTACCT 


CCGAAAGGTC 


GCCGCCGCTC 


720 


ACCAGCTACA 


TC AC AC AG AC 


AAGTTCAATG 


GTGTACCATA 


TGGACTGTTT 


CTTGGACCCA 


780 


AGGAATTGGA 


AGAAGTTGGA 


GGAAATGAAG 


AGTTAGATAA 


GGAGATTAGT 


CGGAGAATCA 


840 


AATCATACAA 


AAAGGCCTCG 


GGCTCCGGGT 


CGAGTTCGAG 


TTCTTGACTT 


TAAACAAGTT 


900 


TTAAATCCCA 


AATTCTTTTT 


TTGTCTTCTG 


TCATTATGAT 


CATCTTAAGA 


CGGTCT 


956 



<210> 4 
<211> 294 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 4 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
15 10 15 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
20 25 30 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie Glu Asn Asp Glu 
35 40 45 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala He Asp Ala Glu Tyr Leu 
50 55 60 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
65 70 75 80 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met 
85 90 95 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
100 105 110 

Glu He Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
115 120 125 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
130 135 140 

Trp His Ala Ser Leu Trp Met Asn His Glu Ser His His Lys Pro Arg 
145 150 155 160 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
165 170 175 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
180 185 190 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly He 
195 200 205 
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Ala Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val 
210 215 220 

Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
225 230 235 240 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
245 250 255 

Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
260 265 270 

Lys Glu He Ser Arg Arg He Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
275 280 285 

Gly Ser Ser Ser Ser Ser 
290 

<210> 5 
<211> 162 
<212> PRT 

<213> Alicalgenes sp . 
<400> 5 

Met Thr Gin Phe Leu He Val Val Ala Thr Val Leu Val Met Glu Leu 
15 10 15 

Thr Ala Tyr Ser Val His Arg Trp He Met His Gly Pro Leu Gly Trp 
20 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 40 45 

Asn Asp Leu Tyr Gly Val Val Phe Ala Val Leu Ala Thr He Leu Phe 
50 55 60 

Thr Val Gly Ala Tyr Trp Trp Pro Val Leu Trp Trp He Ala Leu Gly 
65 70 75 80 

Met Thr Val Tyr Gly Leu He Tyr Phe He Leu His Asp Gly Leu Val 
85 90 95 

His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Arg Gly Tyr Phe Arg 
100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 120 125 

His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Arg Ser Gly Val Leu Arg Pro Gin Asp Glu Arg 
145 150 155 160 

Pro Ser 

<210> 6 
<211> 175 
<212> PRT 

<213> Erwinia herbicola 
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<400> 6 

Met Leu Asn Ser Leu lie Val lie Leu Ser Val lie Ala Met Glu Gly 
1-5 10 15 

lie Ala Ala Phe Thr His Arg Tyr lie Met His Gly Trp Gly Trp Arg 
20 25 30 

Trp His Glu Ser His His Thr Pro Arg Lys Gly Val Phe Glu Leu Asn 
.35 40 45 

Asp Leu Phe Ala Val Val Phe Ala Gly Val Ala lie Ala Leu lie Ala 
50 55 60 

Val Gly Thr Ala Gly Val Trp Pro Leu Gin Trp lie Gly Cys Gly Met 
65 70 75 80 

Thr Val Tyr Gly Leu Leu Tyr Phe Leu Val His Asp Gly Leu Val His 
85 90 95 

Gin Arg Trp Pro Phe His Trp lie Pro Arg Arg Gly Tyr Leu Lys Arg 
100 105 110 

Leu Tyr Val Ala His Arg Leu His His Ala Val Arg Gly Arg Glu Gly 
115 120 125 

Cys Val Ser Phe Gly Phe lie Tyr Ala Arg Lys Pro Ala Asp Leu Gin 
130 135 140 

Ala He Leu Arg Glu Arg His Gly Arg Pro Pro Lys Arg Asp Ala Ala 
145 150 155 160 

Lys Asp Arg Pro Asp Ala Ala Ser Pro Ser Ser Ser Ser Pro Glu 
165 170 175 

<210> 7 
<211> 175 
<212> PRT 

<213> Erwinia uredovora 
<400> 7 

Met Leu Trp He Trp Asn Ala Leu He Val Phe Val Thr Val He Gly 
15 10 15 

Met Glu Val He Ala Ala Leu Ala His Lys Tyr He Met His Gly Trp 
20 25 30 

Gly Trp Gly Trp His Leu Ser His His Glu Pro Arg Lys Gly Ala Phe 
35 40 45 

Glu Val Asn Asp Leu Tyr Ala Val Val Phe Ala Ala Leu Ser He Leu 
50 55 60 

Leu He Tyr Leu Gly Ser Thr Gly Met Trp Pro Leu Gin Trp He Gly 
65 70 75 80 

Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met Val His Asp Gly 
85 90 95 

Leu Val His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Lys Gly Tyr 
100 105 110 
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Leu Lys Arg Leu Tyr Met Ala His Arg Met His His Ala Val Arg Gly 
115 120 125 

Lys Glu Gly Cys Val Ser Phe Gly Phe Leu Tyr Ala Pro Pro Leu Ser 
130 135 140 

Lys Leu Gin Ala Thr Leu Arg Glu Arg His Gly Ala Arg Ala Gly Ala 
145 150 155 160 

Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 . 175 

<210> 8 
<211> 162 
<212> PRT 

<213> Agrobacterium aurianticum 
<400> 8 

Met Thr Asn Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu 
15 10 15 

Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 
20 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 4 0 4 5 

Asn Asp Leu Tyr Gly Leu Val Phe Ala Val lie Ala Thr Val Leu Phe 
50 55 60 

Thr Val Gly Trp lie Trp Ala Pro Val Leu Trp Trp lie Ala Leu Gly 
65 70 75 80 

Met Thr Val Tyr Gly Leu lie Tyr Phe Val Leu His Asp Gly Leu Val 
85 90 95 

His Trp Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr Ala Arg 
100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 120 125 

His Cys Val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Met Ser Gly Val Leu Arg Ala Glu Ala Gin Glu 
145 150 155 160 

Arg Thr 



<210> 9 
<211> 954 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 9 

CCACGGGTCC GCCTCCCCGT TTTTTTCCGA TCCGATCTCC GGTGCCGAGG ACTCAGCTGT 60 
TTGTTCGCGC TTTCTCAGCC GTCACCATGA CCGATTCTAA CGATGCTGGA ATGGATGCTG 120 
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TTCAGAGACG ACTCATGTTT GAAGACGAAT GCATTCTCGT TGATGAAAAT AATCGTGTGG 180 

TGGGACATGA CACTAAGTAT AACTGTCATC TGATGGAAAA GATTGAAGCT GAGAATTTAC 24 0 

TTCACAGAGC TTTCAGTGTG TTTTTATTCA ACTCCAAGTA TGAGTTGCTT CTCCAGCAAC 300 

GGTCAAAAAC AAAGGTTACT TTCCCACTTG TGTGGACAAA CACTTGTTGC AGCCATCCTC 3 60 

TTTACCGTGA ATCCGAGCTT ATTGAAGAGA ATGTGCTTGG TGTAAGAAAT GCCGCACAAA 420 

GGAAGCTTTT CGATGAGCTC GGTATTGTAG CAGAAGATGT ACCAGTCGAT GAGTTCACTC 4 80 

CCTTGGGACG CATGCTTTAC AAGGCACCTT CTGATGGGAA ATGGGGAGAG CACGAAGTTG 54 0 

ACTATCTACT CTTCATCGTG CGGGATGTGA AGCTTCAACC AAACCCAGAT GAAGTGGCTG 600 

AGATCAAGTA CGTGAGCAGG GAAGAGCTTA AGGAGCTGGT GAAGAAAGCA GATGCTGGCG 660 

ATGAAGCTGT GAAACTATCT CCATGGTTCA GATTGGTGGT GGATAATTTC TTGATGAAGT 7 20 

GGTGGGATCA TGTTGAGAAA GGAACTATCA CTGAAGCTGC AGACATGAAA ACCATTCACA 7 80 

AGCTCTGAAC TTTCCATAAG TTTTGGATCT TCCCCTTCCC ATAATAAAAT T AAG AG AT G A 840 

GACTTTTATT GATTACAGAC AAAACTGGCA ACAAAATCTA TTCCTAGGAT TTTTTTTTGC 900 

TTTTTATTTA CTTTTGATTC ATCTCTAGTT TAGTTTTCAT CTTAAAAAAA AAAA 95 4 

<210> 10 
<211> 996 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 10 

CACCAATGTC TGTTTCTTCT TTATTTAATC TCCCATTGAT TCGCCTCAGA TCTCTCGCTC 60 

TTTCGTCTTC TTTTTCTTCT TTCCGATTTG CCCATCGTCC TCTGTCATCG ATTTCACCGA 120 

GAAAGTTACC GAATTTTCGT GCTTTCTCTG GTACCGCTAT G AC AG AT ACT AAAGATGCTG 180 

GTATGGATGC T G T T C AG AG A CGTCTCATGT TTGAGGATGA ATGCATTCTT GTTGATGAAA 24 0 

CTGATCGTGT TGTGGGGCAT GTCAGCAAGT ATAATTGTCA TCTGATGGAA AATATTGAAG 300 

CCAAGAATTT GCTGCACAGG GCTTTTAGTG TATTTTTATT CAACTCGAAG TATGAGTTGC 360 

TTCTCCAGCA AAGGTCAAAC ACAAAGGTTA CGTTCCCTCT AGTGTGGACT AACACTTGTT 420 

GCAGCCATCC TCTTTACCGT GAATCAGAGC TTATCCAGGA CAATGCACTA GGTGTGAGGA 4 80 

ATGCTGCACA AAGAAAGCTT CTCGATGAGC TTGGTATTGT AGCTGAAGAT GTACCAGTCG 54 0 

ATGAGTTCAC TCCCTTGGGA CGTATGCTGT ACAAGGCTCC TTCTGATGGC AAATGGGGAG 600 

AGCATGAACT TGATTACTTG CTCTTCATCG TGCGAGACGT GAAGGTTCAA CCAAACCCAG 660 

AT G AAG TAG C TGAGATCAAG TATGTGAGCC GG G AAG AG CT GAAGGAGCTG GTGAAGAAAG 720 

CAGATGCAGG TGAGGAAGGT TTGAAACTGT CACCATGGTT CAGATTGGTG GTGGACAATT 7 80 
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TCTTGATGAA 


GTGGTGGGAT 


CATGTTGAGA 


AAGGAACTTT 


GGTTGAAGCT 


ATAGACATGA 


840 


AAACCATCCA 




APATPTTTTT 


1 Innnol 111 


1 AAA1 LAA I L> 


AACTTTCTCT 


900 


TCATCATTTT 


TATCTTTTCG 


ATGATAATAA 


TTTGGGATAT 


GTGAGACACT 


TACAAAACTT 


960 


CCAAGCACCT 


CAGGCAATAA 


TAAAGTTTGC 


GGCCGC 






996 


<210> 11 
<211> 1165 
<212> DNA 

<213> Haematococcus pluvialis 










<400> 11 
CTCGGTAGCT 


GGCCACAATC 


GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


GATGCCGCGA 


60 


TGCTTCGTTC 


GTTGCTCAGA 


GGCCTCACGC 


ATATCCCCCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CCAGCTGTGC 


ACACGCGCGA 


CTCCAGTTTA 


AGCTCAGGAG 


CATGCAGATG 


ACGCTCATGC 


180 


AGCCCAGCAT 


CTCAGCCAAT 


CTGTCGCGCG 


CCGAGGACCG 


CACAGACCAC 


ATGAGGGGTG 


240 


CAAGCACCTG 


GGCAGGCGGG 


CAGTCGCAGG 


ATGAGCTGAT 


GCTGAAGGAC 


GAGTGCATCT 


300 


TGGTGGATGT 


TGAGGACAAC 


ATCACAGGCC 


ATGCCAGCAA 


GCTGGAGTGT 


CACAAGTTCC 


360 


TACCACATCA 


GCCTGCAGGC 


CTGCTGCACC 


GGGCCTTCTC 


TGTGTTCCTG 


TTTGACGATC 


420 


AGGGGCGACT 


GCTGCTGCAA 


CAGCGTGCAC 


GCTCAAAAAT 


CACCTTCCCA 


AGTGTGTGGA 


480 


CGAACACCTG 


CTGCAGCCAC 


CCTTTACATG 


GGCAGACCCC 


AGATGAGGTG 


GACCAACTAA 


540 


GCCAGGTGGC 


CGACGGAACA 


GTACCTGGCG 


CAAAGGCTGC 


TGCCATCCGC 


AAGTTGGAGC 


600 


ACGAGCTGGG 


GATACCAGCG 


CACCAGCTGC 


CGGCAAGCGC 


GTTTCGCTTC 


CTCACGCGTT 


660 


TGCACTACTG 


TGCCGCGGAC 


GTGCAGCCAG 


CTGCGACACA 


ATCAGCGCTC 


TGGGGCGAGC 


720 


ACGAAATGGA 


CTACATCTTG 


TTCATCCGGG 


CCAACGTCAC 


CTTGGCGCCC 


AACCCTGACG 


780 


AGGTGGACGA 


AGTCAGGTAC 


GTGACGCAAG 


AGGAGCTGCG 


GCAGATGATG 


CAGCCGGACA 


840 


ACGGGCTGCA 


ATGGTCGCCG 


TGGTTTCGCA 


TCATCGCCGC 


GCGCTTCCTT 


GAGCGTTGGT 


900 


GGGCTGACCT 


GGACGCGGCC 


CTAAACACTG 


AC AAAC AC G A 


GGATTGGGGA 


ACGGTGCATC 


960 


ACATCAACGA 


AGCGTGAAAG 


CAGAAGCTGC 


AGGATGTGAA 


GACACGTCAT 


GGGGTGGAAT 


1020 


TGCGTACTTG 


GCAGCTTCGT 


ATCTCCTTTT 


TCTGAGACTG. 


AACCTGCAGT 


CAGGTCCCAC 


1080 


AAGGTCAGGT 


AAAATGGCTC 


GATAAAATGT 


ACCGTCACTT 


TTTGTCGCGT 


ATACTGAACT 


1140 


CCAAGAGGTC 


AAAAAAAAAA 


AAAAA 








1165 



<210> 12 
<211> 1135 
<212> DNA 

<213> Haematococcus pluvialis 



WO 99/63055 

<400> 12 

CTCGGTAGCT GGCCACAATC 

TGCTTCGTTC GTTGCTCAGA 

CCAGCTGTGC ACACGCGCGA 
ACCGCACAGA CCACATGAGG 

TGATGCTGAA GGACGAGTGC 

GCAAGCTGGA GTGTCACAAG 

TCTCTGTGTT CCTGTTTGAC 

AAATCACCTT CCCAAGTGTG 

CCCCAGATGA GGTGGACCAA 

CTGCTGCCAT CCGCAAGTTG 

GCGCGTTTCG CTTCCTCACG 

CACAATCAGC GCTCTGGGGC 

TCACCTTGGC GCCCAACCCT 

TGCGGCAGAT GATGCAGCCG 

CCGCGCGCTT CCTTGAGCGT 

AC GAG GAT TG GGGAACGGTG 

TGAAGACACG TCATGGGGTG 

ACTGAACCTG CAGAGCTAGA 

GACTAATCTG TAGCTAGAGT 

<210> 13 
<211> 960 
<212> DNA 

<213> Tagetes erecta 
<400> 13 

CCAAAAACAA CTCAAATCTC 

TGGATGCTGT TCAGCGACGT 

ACAATGTGGT GGGACATGAT 

GTAAAATGCT GCACAGAGCA 

TTCAGCAACG GTCTGCAACC 

GCCATCCACT CTACAGAGAA 

GAGGANNNNN NNNNNNNNNN 

NNNNNNNNNN NNNNNNNNNN 
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GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


GATGCCGCGA 


60 


GGCCTCACGC 


ATATCCCGCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CTCCAGTTTA 


AGCTCAGGAG 


CATGCAGCTG 


CTTTCCGAGG 


180 


GGTGCAAGCA 


CCTGGGCAGG 


CGGGCAGTCG 


CAGGATGAGC 


240 


ATCTTGGTAG 


ATGTTGAGGA 


CAACATCACA 


GGCCATGCCA 


300 


TTCCTACCAC 


ATCAGCCTGC 


AGGCCTGCTG 


CACCGGGCCT 


360 


GATCAGGGGC 


GACTGCTGCT 


GCAACAGCGT 


GCACGCTCAA 


420 


TGGACGAACA 


CCTGCTGCAG 


CCACCCTTTA 


CATGGGCAGA 


480 


CTAAGCCAGG 


TGGCCGACGG 


AACAGTACCT 


GGCGCAAAGG 


540 


GAGCACGAGC 


TGGGGATACC 


AGCGCACCAG 


CTGCCGGCAA 


600 


CGTTTGCACT 


ACTGTGCCGC 


GGACGTGCAG 


CCAGCTGCGA 


660 


GAGCACGAAA 


TGGACTACAT 


CTTGTTCATC 


CGGGCCAACG 


720 


GACGAGGTGG 


ACGAAGTCAG 


GTACGTGACG 


CAAGAGGAGC 


780 


GACAACGGGC 


TTCAATGGTC 


GCCGTGGTTT 


CGCATCATCG 


840 


TGGTGGGCTG 


ACCTGGACGC 


GGCCCTAAAC 


ACTGACAAAC 


900 


CAT C AC AT C A 


ACGAAGCGTG 


AAGGCAGAAG 


CTGCAGGATG 


960 


GAATTGCGTA 


CTTGGCAGCT 


TCGTATCTCC 


TTTTTCTGAG 


1020 


GTCAATGGTG 


CATCATATTC 


ATCGTCTCTC 




X\JO\J 


CACTGATGAA 


TCCTTTACAA 


CTTTCAAAAA 


AAAAA 


1135 


CTCCGTCGCT 


CTTACTCCGC 


CATGGGTGAC 


GACTCCGGCA 


60 


CTCATGTTTG 


ACGATGAATG 


CATTTTGGTG 


GATGAGTGTG 


120 


ACCAAATACA 


ATTGTCACTT 


GATGGAGAAG 


ATTGAAACAG 


180 


TTCAGCGTTT 


TTCTATTCAA 


TTCAAAATAC 


GAGTTACTTC 


240 


AAGGTGACAT 


TTCCTTTAGT 


ATGGACCAAC 


ACCTGTTGCA 


300 


TCCGAGCTTG 


TTCCCGAAAC 


GCCTGAGAGA 


ATGCTGCACA 


360 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


420 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


4 80 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 54 0 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 600 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 660 

NNNNNNNNNN NNNNNNNNNN TCATGTGCAA AAGGGTACAC TCACTGAATG CAATTTGATA 720 

TGAAAACCAT ACACAAGCTG ATATAGAAAC ACACCCTCAA CCGAAAAGCA AGCCTAATAA 780 

TTCGGGTTGG GTCGGGTCTA CCATCAATTG TTTTTTTCTT TTAACAACTT TTAATCTCTA 84 0 

TTTGAGCATG TTGATTCTTG TCTTTTGTGT GTAAGATTTT GGGTTTCGTT TCAGTTGTAA 900 

TAATGAACCA TTGATGGTTT GCAATTTCAA GTTCCTATCG ACATGTAGTG ATCTAAAAAA 960 

<210> 14 
<211> 305 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 14 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser lie Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys lie 
65 70 75 80 

Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala Ser Lys Leu Glu 
85 90 95 

Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu Leu His Arg Ala 
100 105 110 

Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr Asn Thr Cys 
130 135 140 

Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu 
145 150 155 160 

Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala lie 
165 170 175 

Arg Lys Leu Glu His Glu Leu Gly lie Pro Ala His Gin Leu Pro Ala 
180 185 190 

Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val 
195 200 205 
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Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp 
210 215 220 

Tyr He Leu Phe He Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp 
225 230 235 240 

Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met 
245 250 255 

Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg He He 
260 265 270 

Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu 
275 280 285 

Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His He Asn Glu 
290 295 300 

Ala 

305 

<210> 15 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 15 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg Gly 
35 40 45 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala 
65 70 75 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
85 90 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu 
100 105 110 

Leu Leu Gin Gin Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 155 160 

Ala Ala Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His 
165 170 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys 
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180 185 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu 
195 200 205 

His Glu Met Asp Tyr lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg lie lie Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 265 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 285 

His lie Asn Glu Ala 
290 

<210> 16 
<211> 284 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 16 

Met Ser Val Ser Ser Leu Phe Asn Leu Pro Leu lie Arg Leu Arg Ser 
15 10 15 

Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg Phe Ala His Arg Pro 
20 25 30 

Leu Ser Ser lie Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

Gly Thr Ala Met Thr Asp Thr Lys Asp Ala Gly Met Asp Ala Val Gin 
50 55 60 

Arg Arg Leu Met Phe Glu Asp Glu Cys lie Leu Val Asp Glu Thr Asp 
65 70 75 80 

Arg Val Val Gly His Val Ser Lys Tyr Asn Cys His Leu Met Glu Asn 
85 90 95 

lie Glu Ala Lys Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe 
100 105 110 

Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Asn Thr Lys Val 
115 120 125 

Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr 
130 135 140 

Arg Glu Ser Glu Leu lie Gin Asp Asn Ala Leu Gly Val Arg Asn Ala 
145 150 155 160 

Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Val Ala Glu Asp Val 
165 170 175 
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Pro Val Asp Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro 
180 185 190 

Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie 
195 200 205 

Val Arg Asp Val Lys Val Gin Pro Asn Pro Asp Glu Val Ala Glu lie 
210 215 220 

Lys Tyr Val Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala Asp 
225 230 235 240 

Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val 
245 250 255 

Asp Asn Phe Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr Leu 
260 265 270 

Val Glu Ala lie Asp Met Lys Thr lie His Lys Leu 
275 280 

<210> 17 
<211> 287 
<212> PRT 

<213> Clarkia breweri 
<400> 17 

Met Ser Ser Ser Met Leu Asn Phe Thr Ala Ser Arg lie Val Ser Leu 
15 10 15 

Pro Leu Leu Ser Ser Pro Pro Ser Arg Val His Leu Pro Leu Cys Phe 
20 25 30 

Phe Ser Pro lie Ser Leu Thr Gin Arg Phe Ser Ala Lys Leu Thr Phe 
35 40 45 

Ser Ser Gin Ala Thr Thr Met Gly Glu Val Val Asp Ala Gly Met Asp 
50 55 60 

Ala Val Gin Arg Arg Leu Met Phe Glu Asp Glu Cys lie Leu Val Asp 
65 70 75 80 

Glu Asn Asp Lys Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu 
85 90 95 

Met Glu Lys lie Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val 
100 105 110 

Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala 
115 120 125 

Thr Lys Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His 
130 135 140 

Pro Leu Tyr Arg Glu Ser Glu Leu lie Asp Glu Asn Cys Leu Gly Val 
145 150 155 160 

Arg Asn Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Pro Ala 
165 170 175 

Glu Asp Leu Pro Val Asp Gin Phe lie Pro Leu Ser Arg lie Leu Tyr 
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180 185 190 

Lys Ala Pro- Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu 
195 200 205 

Leu Phe lie lie Arg Asp Val Asn Leu Asp Pro Asn Pro Asp Glu Val 
210 215 220 

Ala Glu Val Lys Tyr Met Asn Arg Asp Asp Leu Lys Glu Leu Leu Arg 
225 230 235 " 240 

Lys Ala Asp Ala Glu Glu Glu Gly Val Lys Leu Ser Pro Trp Phe Arg 
245 250 255 

Leu Val Val Asp Asn Phe Leu Phe Lys Trp Trp Asp His Val Glu Lys 
260 265 270 

Gly Ser Leu Lys Asp Ala Ala Asp Met Lys Thr lie His Lys Leu 
275 280 285 

<210> 18 
<211> 261 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Thr Gly Pro Pro Pro Arg Phe Phe Pro lie Arg Ser Pro Val Pro Arg 
15 10 15 

Thr Gin Leu Phe Val Arg Ala Phe Ser Ala Val Thr Met Thr Asp Ser 
20 25 30 

Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe Glu Asp 
35 40 45 

Glu Cys lie Leu Val Asp Glu Asn Asn Arg Val Val Gly His Asp Thr 
50 55 60 

Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala Glu Asn Leu Leu 
65 70 75 80 

His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu 
85 90 95 

Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro Leu Val Trp Thr 
100 105 110 

Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu lie Glu 
115 120 125 

Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Phe Asp 
130 135 140 

Glu Leu Gly lie Val Ala Glu Asp Val Pro Val Asp Glu Phe Thr Pro 
145 150 155 160 

Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu 
165 170 175 

His Glu Val Asp Tyr Leu Leu Phe lie Val Arg Asp Val Lys Leu Gin 
180 185 190 
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Pro Asn Pro Asp Glu Val Ala Glu lie Lys Tyr Val Ser Arg Glu Glu 
195 200 205 

Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp Glu Ala Val Lys 
210 215 220 

Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Met Lys Trp 
225 230 235 240 

Trp Asp His Val Glu Lys Gly Thr He Thr Glu Ala Ala Asp Met Lys 
245 250 255 

Thr He His Lys Leu 
260 

<210> 19 
<211> 288 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 19 

Met Thr Ala Asp Asn Asn Ser Met Pro His Gly Ala Val Ser Ser Tyr 
15 10 15 

Ala Lys Leu Val Gin Asn Gin Thr Pro Glu Asp He Leu Glu Glu Phe 
20 25 30 

Pro Glu He He Pro Leu Gin Gin Arg Pro Asn Thr Arg Ser Ser Glu 
35 40 45 

Thr Ser Asn Asp Glu Ser Gly Glu Thr Cys Phe Ser Gly His Asp Glu 
50 55 60 

Glu Gin He Lys Leu Met Asn Glu Asn Cys He Val Leu Asp Trp Asp 
65 70 75 80 

Asp Asn Ala He Gly Ala Gly Thr Lys Lys Val Cys His Leu Met Glu 
85 90 95 

Asn He Glu Lys Gly Leu Leu His Arg Ala Phe Ser Val Phe He Phe 
100 105 110 

Asn Glu Gin Gly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu Lys He 
115 120 125 

Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu Cys 
130 135 140 

He Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys He Lys 
145 150 155 160 

Gly Ala He Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu Gly He 
165 170 175 

Pro Glu Asp Glu Thr Lys Thr Arg Gly Lys Phe His Phe Leu Asn Arg 
180 185 190 

He His Tyr Met Ala Pro Ser Asn Glu Pro Trp Gly Glu His Glu He 
195 200 205 

Asp Tyr He Leu Phe Tyr Lys He Asn Ala Lys Glu Asn Leu Thr Val 
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210 



215 



220 



Asn Pro Asn Val Asn Glu Val Arg Asp Phe Lys Trp Val Ser Pro Asn 
225 230 235 240 

Asp Leu Lys Thr Met Phe Ala Asp Pro Ser Tyr Lys Phe Thr Pro Trp 
245 250 255 

Phe Lys lie lie Cys Glu Asn Tyr Leu Phe Asn Trp Trp Glu Gin Leu 
260 265 270 

Asp Asp Leu Ser Glu Val Glu Asn Asp Arg Gin lie His Arg Met Leu 
275 280 285 

<210> 20 
<211> 456 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence of four plant B-cyclases 

<400> 20 

Met Asp Thr Leu Leu Lys Thr Pro Asn Leu Glu Phe Leu Pro His Gly 
15 10 15 

Phe Val Lys Ser Phe Ser Lys Phe Gly Lys Cys Glu Gly Val Cys Val 
20 25 30 

Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys Glu Asn 
35 40 45 

Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val Val Asp 
50 55 60 

Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val Ala Gin Gin 
65 70 75 80 

Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro Pro Lys Leu 
85 90 95 

lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala Met 
100 105 110 

Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly Ala Val Tyr lie 
115 120 125 

Asp Asp Thr Lys Asp Leu Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin 
130 135 140 

Leu Lys Ser Lys Met Met Gin Lys Cys lie Asn Gly Val Lys Phe His 
145 150 155 160 

Gin Ala Lys Val lie Lys Val He His Glu Glu Lys Ser Met Leu He 
165 170 175 

Cys Asn Asp Gly Thr He Gin Ala Thr Val Val Leu Asp Ala Thr Gly 
180 185 190 

Phe Ser Arg Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 



WO 99/63055 ^ PCT/US99/12121 

195 200 205 

Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Lys 
210 215 220 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Asn Glu Leu Lys 
225 230 235 240 

Glu Arg Asn Ser He Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser 
245 250 255 

Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly Leu 
260 265 270 

Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arg Leu His Leu Gly 
275 280 285 

He Lys Val Lys Ser He Glu Glu Asp Glu His Cys Val He Pro Met 
290 295 300 

Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly He Gly Gly 
305 310 315 320 

Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr 
325 330 335 

Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He Tyr Leu Gly Ser 
340 345 350 

Glu Ser Ser Gly Glu Leu Ser Ala Glu Val Trp Lys Asp Leu Trp Pro 
355 360 365 

He Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly Met Asp He 
370 375 380 

Leu Leu Lys Leu Asp Leu Pro Ala Thr Arg Arg Phe Phe Asp Ala Phe 
385 390 395 400 

Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu 
405 410 415 

Phe Leu Pro Glu Leu He Val Phe Gly Leu Ser Leu Phe Ser His Ala 
420 425 430 

Ser Asn Thr Ser Arg Glu He Met Thr Lys Gly Thr Pro Leu Val Met 
435 440 445 

He Asn Asn Leu Leu Gin Asp Glu 
450 455 

<210> 21 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 21 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 ' 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 
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Ser Tyr Arg Asn He Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 95 

Asp Lys Leu Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val 
100 105 110 

Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 160 

Lys Cys He Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly He Arg He Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro 
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355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lvs Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin He Asn Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

He Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
4 65 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser 



485 



Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu He Ser Asp Pro 
500 505 510 

Thr Glv Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 22 
<211> 1898 
<212> DNA 

<213> Adonis palaestina 

AAAGGAGTGT TCTATTAATG TTACTGTCGC ATTCTTGCAA CACTTATATT CAAACTCCAT 60 
TTTCTTCTTT TCTCTTCAAA ACAACAAACT AATGTGAGCA GAGTATCTGG CTATGGAACT 120 
ACTTGGTGTT CGCAACCTCA TCTCTTCTTG CCCTGTGTGG ACTTTTGGAA CAAGAAACCT 
TAGTAGTTCA AAACTAGCTT ATAACATACA TCGATATGGT TCTTCTTGTA GAGTAGATTT 
TCAAGTGAGA GCTGATGGTG GAAGCGGGAG TAGAAGTTCT GTTGCTTATA AAGAGGGTTT 300 
TGTGGATGAA GAGGATTTTA TCAAAGCTGG TGGTTCTGAG CTTTTGTTTG TCCAAATGCA 
GCAAACAAAG TCTATGGAGA AACAGGCCAA GCTCGCCGAT AAGTTGCCAC CAATACCTTT 
TGGAGAATCC GTGATGGACT TGGTTGTAAT AGGTTGTGGA CCTGCTGGTC TTTCACTGGC 
TGCAGAAGCT GCTAAGCTAG GGTTGAAAGT TGGCCTTATT GGTCCTGATC TTCCTTTTAC 
AAATAATTAT GGTGTGTGGG AAGACGAGTT CAAAGATCTT GGACTTGAAC GTTGTATCGA 
GCATGCTTGG AAGGACACCA TCGTATATCT TGATAATGAT GCTCCTGTCC TTATTGGTCG 
TGCATATGGA CGAGTTAGTC GACATTTGCT ACATGAGGAG TTGCTGAAAA GGTGTGTGGA 



180 
240 



360 
420 
480 
540 
600 
660 
720 



7.tr 
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GTCAGGTGTA TCATATCTTG ATTCTAAAGT GGAAAGGATC ACTGAAGCTG GTGATGGCCA 78 0 

TAGCCTTGTA GTTTGTGAAA ATGAGATCTT TATCCCTTGC AGGCTTGCTA CTGTTGCATC 84 0 

TGGAGCAGCT TCAGGGAAAC TTTTGGAGTA TGAAGTAGGT GGCCCTCGTG TTTGTGTCCA 90 0 

AACCGCTTAT GGGGTGGAGG TTGAGGTGGA GAACAATCCA TACGATCCCA ACTTAATGGT 960 

ATTCATGGAC TACAGAGACT ATATGCAACA GAAATTACAG TGCTCGGAAG AAGAATATCC 102 0 

AACATTTCTC TATGTCATGC CCATGTCGCC AACAAGACTT TTTTTTGAGG AAACCTGTTT 1080 

GGCCTCAAAA GATGCCATGC CATTCGATCT ACTGAAGAGA AAACTGATGT CACGATTGAA 114 0 

GACTCTGGGT ATCCAAGTTA CAAAAGTTTA TGAAGAGGAA TGGTCATATA TTCCTGTTGG 1200 

TGGTTCTTTA CCAAACACAG AGCAAAAGAA CCTAGCATTT GGTGCTGCAG CAAGCATGGT 12 60 

GCATCCAGCA ACAGGCTATT CGGTTGTACG GTCACTGTCA GAAGCTCCAA AATATGCTTC 1320 

TGTAATTGCA AAGATTTTGA AGCAAGATAA CTCTGCGTAT GTGGTTTCTG GACAAAGTAG 1380 

TGCAGTAAAC ATTTCAATGC AAGCATGGAG CAGTCTTTGG CCAAAGGAGC GAAAACGTCA 14 4 0 

AAGAGCATTC TTTCTTTTTG GATTAGAGCT TATTGTGCAG CTAGATATTG AAGCAACCAG 1500 

AACATTCTTT AGAACCTTCT TCCGCTTGCC AACTTGGATG TGGTGGGGTT TCCTTGGGTC 1560 

TTCACTATCA TCTTTCGATC TCGTCTTGTT TTCCATGTAC ATGTTTGTTT TGGCGCCAAA 1620 

CAGCATGAGG ATGTCACTTG TGAGACATTT GCTTTCAGAT CCTTCTGGTG CAGTTATGGT 1680 

AAGAGCTTAC CTCGAAAGGT AGTCTCATCT ATTATTAAAC TCTAGTGTTT CACCAAATAA 17 4 0 

ATGAGGATCC TTCGAATGTG TATATGATCA TCTCTATGTA TATCCTGTAC TCTAATCTCA 18 00 

TAAAGTAAAT GCCGGGTTTG ATATTGTTGT GTCAAACCGG CCAATGATAT AAAGTAAATT 18 60 

TATTGATACA AAAGTAGTTT TTTTCCTTAA AAAAAAAA 18 98 

<210> 23 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 23 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
15 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 



2,2 
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Gin Met Gin Gin 



Lys Leu Pro Pro 
100 

He Gly Cys Gly 
115 



Leu Gly Leu Lys 
130 



Asn Tyr Gly Val 
145 

Cys He Glu His 



Ala Pro Val Leu 
180 

Leu His Glu Glu 
195 

Leu Asp Ser Lys 
210 

Leu Val Val Cys 
225 

Val Ala Ser Gly 



Gly Pro Arg Val 
260 

Glu Asn Asn Pro 
275 

Asp Tyr Met Gin 
290 

Phe Leu Tyr Val 
305 

Thr Cys Leu Ala 



Lys Leu Met Ser 
340 

Tyr Glu Glu Glu 
355 

Thr Glu Gin Lys 
370 

Pro Ala Thr Gly 
385 

Tyr Ala Ser Val 



Thr Lys Ser Met 
85 

He Pro Phe Gly 



Pro Ala Gly Leu 
120 

Val Gly Leu He 
135 

Trp Glu Asp Glu 
150 

Ala Trp Lys Asp 
165 

lie Gly Arg Ala 



Leu Leu Lys Arg 
200 

Val Glu Arg He 
215 

Glu Asn Glu He 
230 

Ala Ala Ser Gly 
245 

Cys Val Gin Thr 



Tyr Asp Pro Asn 
280 

Gin Lys Leu Gin 
295 

Met Pro Met Ser 
310 

Ser Lys Asp Ala 
325 

Arg Leu Lys Thr 



Trp Ser Tyr He 
360 

Asn Leu Ala Phe 
375 

Tyr Ser Val Val 
390 

He Ala Lys He 



Glu Lys Gin Ala 
90 



Glu Ser Val Met 
105 

Ser Leu Ala Ala 



Gly Pro Asp Leu 
140 

Phe Lys Asp Leu 
155 

Thr He Val Tyr 
170 

Tyr Gly Arg Val 
185 

Cys Val Glu Ser 



Thr Glu Ala Gly 
220 

Phe He Pro Cys 
235 

Lys Leu Leu Glu 
250 

Ala Tyr Gly Val 
265 

Leu Met Val Phe 



Cys Ser Glu Glu 
300 

Pro Thr Arg Leu 
315 

Met Pro Phe Asp 
330 

Leu Gly He Gin 
345 

Pro Val Gly Gly 



Gly Ala Ala Ala 
380 

Arg Ser Leu Ser 
395 

Leu Lys Gin Asp 



2-S 



Lys Leu Ala Asp 
95 

Asp Leu Val Val 
110 

Glu Ala Ala Lys 
125 

Pro Phe Thr Asn 



Gly Leu Glu Arg 
160 

Leu Asp Asn Asp 
175 

Ser Arg His Leu 
190 

Gly Val Ser Tyr 
205 

Asp Gly His Ser 



Arg Leu Ala Thr 
240 

Tyr Glu Val Gly 
255 

Glu Val Glu Val 
270 

Met Asp Tyr Arg 
285 

Glu Tyr Pro Thr 



Phe Phe Glu Glu 
320 

Leu Leu Lys Arg 
335 

Val Thr Lys Val 
350 



Ser Leu Pro Asn 
365 

Ser Met Val His 



Glu Ala Pro Lys 
400 

Asn Ser Ala Tyr 
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405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 24 
<211> 1370 
<212> DNA 
<213> Potato 

<400> 24 

TAGCGGAGGA TGAGTTCAAA GATCTTGGTC TTCAAGCCTG CATTGAACAT GTTTGGCTGG 60 

GATACCATTG TATATCTTGA T GAT GAT GAT CCTATTCTTA TTGGCCGTGC CTATGGAAGA 120 

GTTAGTCGCC ATTTACTGCA CGAGGAGTTA CTCAAAAGGT GTGTGGAGGC AGGTGTTTTG 180 

TATCTAAACT CGAAAGTGGA TAGGATTGTT GAGGCCACAA ATGGCCACAG TCTTGTAGAG 2 40 

TGCGAGGGTG ATGTTGTGAT TCCCTGCAGG TTTGTGACTG TTGCATCGGG AGCAGCCTCG 300 

GGGAAATTCT TGCAGTATGA GTTGGGAGGT CCTAGAGTTT CTGTTCAAAC AGCTTATGGA 360 

GTGGAAGTTG AGGTCGATAA CAATCCATTT GACCCGAGCC TGATGGTTTT CATGGATTAT 420 

AG AG AC TAT G TCAGACACGA CGCTCAATCT TTAGAAGCTA AATATCCAAC ATTTCTCTAT 4 80 

GCCATGCCCA TGTCTCCAAC ACGAGTCTTT TTCGAGGAAA CTTGTTTGGC TTCAAAAGAT 54 0 

GCAATGCCAT TCGATCTGTT AAAGAAAAAA TTGATGTTAC GATTGAACAC CCTCGGTGTA 600 

AGAATTAAAG AAATTTATGA GGAGGAATGG TCTTACATAC CAGTTGGAGG ATCTTTGCCA 660 

AATACAGAAC AAAAAACACT TGCATTTGGT GCTGCTGCTA GCATGGTTCA TCCAGCCACA 720 

GGTTATTCAG TCGTCAGATC ACTGTCTGAA GCTCCAAAAT GCGCCTTCGT GCTTGCAAAT 780 

ATATTACGAC AAAATCATAG CAAGAATATG CTTACTAGTT CAAGTACCCC GAGTATTTCA 840 

ACTCAAGCTT GGAACACTCT TTGGCCACAA GAACGAAAAC GACAAAGATC GTTTTTCCTA 900 
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TTTGGACTGG CTCTGATATT GCAGCTGGAT ATTGAGGGGA TAAGGTCATT TTTCCGCGCG 960 

TTCTTCCGTG TGCCAAAATG GATGTGGCAG GGATTTCTTG GTTCAAGTCT TTCTTAGCAG 1020 

ACCTCATGTT ATTTGCCTTC TACATGTTTA TTATTGCACC AAATGACATG AGAAGAGGCT 108 0 

TAATCAGACA TCTTTTATCT GATCCTACTG GTGCAACATT GATAAGAACT TATCTTACAT 114 0 

TTTAGAGTAA ATTCCTCCTA CAATAGTTGT TGAAAGAGGC CTCATTACTT CAGATTCATA 1200 

ACAGAAATCG CGGTCTCTCG AGGCCTTGTA TATAACATTT TCACTAGGTT AATATTGCTT 12 60 

GAATAAGTTG CACAGTTTCA GTTTTTGTAT CTGCTTCTTT TTTGTCCAAG ATCATGTATT 1320 

GACCAATTTA TATACATTGC CAGTATATAT AAATTTTATA AAAAAAAAAA 137 0 

<210> 25 

<211> 377 

<212> PRT 

<213> Potato 

<400> 25 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys lie Glu His Val Trp 
15 10 15 

Arg Asp Thr lie Val . Tyr Leu Asp Asp Asp Asp Pro lie Leu lie Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 

Arg lie Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 80 

Asp Val Val lie Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 
85 90 95 

Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 105 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 170 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 190 

Asn Thr Leu Gly Val Arg lie Lys Glu lie Tyr Glu Glu Glu Trp Ser 
195 200 205 



2X 
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Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu 
210 215 220 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn lie Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu lie Leu 
290 295 300 

Gin Leu Asp He Glu Gly He Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 310 315 320 

Val Pro Lys Met Met Trp Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala 
325 330 335 

Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He He Ala Pro Asn Asp 
340 345 350 

Met Arg Arg Gly Leu He Arg His Leu Leu Ser Asp Pro Thr Gly Ala 
355 360 365 

Thr Leu He Arg Thr Tyr Leu Thr Phe 
370 375 

<210> 26 
<211> 533 
<212> PRT 

<213> Chimeric lettuce/potato 
<400> 26 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
15 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn He Arg His Lys Phe Ser Leu 
20 25 30 

Leu Lys Gly Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
35 40 45 

Gin He Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50 55 60 

Gly He Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin He Pro He Gly Asn Cys He 
100 105 110 

Leu Asp Leu Val Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 



2n 
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115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu lie Gly Pro Asp 
130 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe lie Gly 
145 150 155 160 

Leu Gly Leu Glu Gly Cys lie Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro lie Arg lie Gly Arg Ala Tyr Gly Arg 
180 185 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu lie Glu Cys Glu Gly Asn lie Thr lie Pro 
225 230 235 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Ser Val Gin Thr Ala Tyr Gly 
260 265 270 

Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp Pro Ser Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp Ala Gin Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro Met Ser Pro Thr Arg 
305 310 315 320 

Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe 
325 330 335 

Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu Asn Thr Leu Gly Val 
340 345 350 

Arg lie Lys Glu lie Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 400 

Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala Asn lie Leu Arg Gin 
405 410 415 

Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser Thr Pro Ser lie Ser 
420 425 430 

Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu Arg Lys Arg Gin Arg 
435 440 445 



2* 
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Ser Phe Phe Leu Phe Gly Leu Ala Leu lie Leu Gin Leu Asp lie Glu 
450 455 460 

Gly lie Arg Ser Phe Phe Arg Ala Phe Phe Arg Val Pro Lys Trp Met 
465 470 475 480 

Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala Asp Leu Met Leu 
485 490 495 

Phe Ala Phe Tyr Met Phe lie lie Ala Pro Asn Asp Met Arg Arg Gly 
500 505 510 

Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Leu lie Arg 
515 520 525 

Thr Tyr Leu Thr Phe 
530 

<210> 27 
<211> 374 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 27 

Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys lie Glu His Val 
1 5 10 15 

Trp Arg Glu Thr lie Val Tyr Leu Asp Asp Asp Lys Pro lie Thr lie 
20 25 30 

Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His Glu Glu Leu 
35 40 45 

Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser Ser Lys Val 
50 55 60 

Asp Ser lie Thr Glu Ala Ser Asp Gly Leu Arg Leu Val Ala Cys Asp 
65 70 75 80 

Asp Asn Asn Val lie Pro Cys Arg Leu Ala Thr Val Ala Ser Gly Ala 
85 90 95 

Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro Arg Val Cys 
100 105 110 

Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn Ser Pro Tyr 
115 120 125 

Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr Thr Asn Glu 
130 135 140 

Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu Tyr Ala Met 
145 150 155 160 

Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys Leu Ala Ser 
165 170 175 

Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu Met Leu Arg 
180 185 190 

Leu Asp Thr Leu Gly lie Arg lie Leu Lys Thr Tyr Glu Glu Glu Trp 



2J\ 
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195 

Ser Tyr lie Pro Val 
210 

Leu Ala Phe Gly Ala 
225 

Ser Val Val Arg Ser 
245 

Ala Glu lie Leu Arg 
260 

Ser Arg Gin Ala Trp 
275 

Arg Ala Phe Phe Leu 
290 

Glu Gly lie Arg Ser 
305 

Met Trp Gin Gly Phe 
325 

Leu Phe Ala Leu Tyr 
340 

Gly Leu lie Asn His 
355 

Lys Thr Tyr Leu Lys 
370 

<210> 28 
<211> 1002 
<212> DNA 
<213> Adonis palaestina 

<400> 28 

ATTCATCTTC AGCAGCGCTG TCGTACTCTT TCTATATCTT CTTCCATCAC TAACAGTAGT 60 

CGCCGACGGT TGAATCGGCT ATTCGCCTCA ACGTCAACTA TGGGTGAAGT CACTGATGCT 120 

GGAATGGATG CTGTTCAGAA GCGGCTCATG TTCGACGACG AATGTATTTT GGTGGATGAG 180 

AATGACAAGG TCGTCGGGCA TGATTCCAAA TACAACTGTC ATTTGATGGA AAAGATAGAG 24 0 

GCAGAAAATT TGCTTCACAG AGCCTTCAGT GTTTTCTTGT TCAACTCAAA ATATGAATTG 300 

CTTCTTCAGC AACGATCCGC CACAAAGGTA ACATTCCCGC TCGTATGGAC AAACACATGT 360 

TGCAGTCATC CTCTCTTTCG TGATTCCGAG CTCATAGAAG AAAATTATCT CGGTGTACGA 420 

AACGCTGCAC AAAGAAAGCT TTTAGACGAG CTAGGCATTC CAGCTGAAGA TGTCCCAGTT 4 80 

GATGAATTTA CTCCTCTTGG TCGCATTCTT TACAAAGCTC CATCTGACGG CAAATGGGGA 54 0 

GAGCACGAAT TGGACTATCT CCTATTTATT GTCCGAGATG TGAAATACGA TCCAAACCCA 600 
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200 205 

Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn 
215 220 

Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr 
230 235 240 

Leu Ser Glu Ala Pro Lys Tyr Ala Ser Val lie 
250 255 

Glu Glu Thr Thr Lys Gin lie Asn Ser Asn lie 
265 270 

Asp Thr Leu Trp Pro Pro Glu Arg Lys Arg Gin 
280 285 

Phe Gly Leu Ala Leu lie Val Gin Phe Asp Thr 
295 300 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp 
310 315 320 

Leu Gly Ser Thr Leu Thr Ser Gly Asp Leu Val 
330 335 

Met Phe Val lie Ser Pro Asn Asn Leu Arg Lys 
345 350 

Leu lie Ser Asp Pro Thr Gly Ala Thr Met lie 
360 365 

Val 



3o 
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GATGAAGTTG CTGATGCTAA GTATGTTAAT 

GCTGATGCTG GTGAAGAGGG ACTCAAGTTG 

TTTTTGTTCA AGTGGTGGGA TCATGTAGAG 

AAAACTATCC ACAAGTTGAC TTAAGAGGAC 

GCTACAATAA GTGGGTGGTG ATAAGCAGTT 

ATTTGCCTCG ATGTTGAACT TGTAACATAT 

TTTGAGGCTG AATTTATATT TTTGGGAACA 

<210> 29 
<211> 1271 
<212> DNA 

<213> Adonis palaestina 
<400> 29 

TTTTAAAGCT CTTTCGCTCC ACCACCATCA 
AAAAACACCG CTTTGGGCTT TGGCCCCTCC 
TAAACCAGTA ATTCTCGGTT TTAATTTGTT 
AGAATTATGT CGTCGATCAG GATTAATCCT 
ACATTATCAG CTTCGTGTTC TTCTCCCGCT 
TTCTATTTCT TCTTCCATCA CTAACAGTCC 
AACGTCGACT ATGGGTGAAG TCGCTGATGC 
GTTCGACGAT GAATGTATTT TGGTGGATGA 
ATACAACTGT CATTTGATGG AAAAGATAGA 
TGTTTTCTTA TTCAACTCAA AATACGAGTT 
AACATTCCCG CTCGTATGGA CAAACACCTG 
ACTCATAGAA GAAAATTTTC TCGGGGTACG 
GCTAGGCATT CCAGCTGAAG ACGTACCAGT 
TTACAAAGCT CCATCTGACG GAAAATGGGG 
TGTCCGAGAT GTGAAATACG ATCCAAACCC 
TCGCGAGGAG TTGAAAGAGA TACTGAGAAA 
GTCTCCTTGG TTTAGATTGG TTGTGGATAA 
GGAGGGGAAG ATTAAGGACG TCGCCGACAT 
AGTCTCTTAA GTTCTACTAT TTGGTTTTTG 
TTATGCTTCC TTTAATTTTG GCTTTTCAAT 
TAGTCAAATA TGAGACCTTG TGAGTTGAAT 
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CGCGAGGAGT TGAGAGAGAT AC T G AG AAAA 660 

TCTCCTTGGT TTAGATTGGT TGTTGATAAC 7 20 

CAGGGTACGA TTAAGGAAGT TGCTGACATG 780 

TTCTCTCCTC TGTTCTACTA TTTGTTTTTT 84 0 

TTTCTGTTTT CTTTAATTTA TGGCTTTTGA 900 

TTAGACAAAT ATGAGACCTT GTAAGTTGAA 960 

TAATAATGTT AA 1002 



AAGCCAGCCA AATTTCTCTG TACAAAAGTT 60 

ATATCGGAAT CCTTGTTTAC GATACGCATC 120 

TCCTAAATTA GGCCCCTTTC CGGAATCCCG 180 

TTATATAGTA TCTTCTCCAC C ACC AC T AAA 24 0 

GTTCATCTTC AGCAGCGTTG TACGTACTCT 300 

TCGCCGAGGG TTGAATCGGC TGTTCGCCTC 360 

TGGTATGGAT GCCGTCCAGA AGCGGCTTAT 420 

GAATGACAAG GTCGTCGGAC ATGATTCCAA 4 80 

GGCAGAAAAC TTGCTTCACA GAGCCTTCAG 54 0 

GCTTCTTCAG CAACGATCTG CAACGAAGGT 600 

TTGCAGCCAT CCCCTCTTCC GTGATTCCGA 660 

AAACGCTGCA CAAAGGAAGC TTTTAGACGA 7 20 

TGATGAATTC ACTCCTCTTG GTCGCATTCT 7 80 

AGAGCACGAA CTGGACTATC TTCTGTTTAT 84 0 

AGATGAAGTT GCTGACGCTA AGTACGTTAA 900 

AGCTGATGCA GGTGAAGAGG GAATAAAGTT 960 

CTTTTTGTTC AAGTGGTGGG ATCATGTAGA 1020 

GAAAACTATC CACAAGTTGA CTTAAGAGAA 1080 

CTTCAATAAG TGGATGGTGA TGAGCAGTTT 114 0 

TTGCTTTATG TGTTGAACTT GTAACATATT 1200 

TTGAGGTTAT ATTTATAGTT TTGGGAACAT 12 60 
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AAAAAAAAAA A 



PCT/US99/12I21 

1271 



<210> 30 
<211> 1109 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 30 

TGGAACCTGG CCCGGCGGCA GTCCGATGCC GCGATGCTTC GTTCGTTGCT CAGAGGCCTC 60 

ACGCATATCC CGCGCGTGAA CTCCGCCCAG CAGCCCAGCT GTGCACACGC GCGACTCCAG 12 0 

TTTAAGCTCA GGAGCATGCA GCTGCTTGCC GAGGACCGCA CAGACCACAT GAGGGGTGCA 180 

AGCACCTGGG CAGGCGGGCA GTCGCAGGAT GAGCTGATGC TGAAGGACGA GTGCATCTTA 24 0 

GTGGATGCTG AC G AC AAC AT CACAGGCCAT GCCAGCAAGC TGGAGTGCCA CAAATTCCTA 300 

CCACATCAGC CTGCAGGCCT GCTGCACCGG GCCTTCTCTG TGTTCCTGTT TGACGACCAG 360 

GGGCGACTGC TGCTGCAACA GCGTGCACGC TCAAAAATCA CCTTCCCAAG TGTGTGGACG 420 

AACACCTGCT GCAGCCACCC TCTACATGGG CAGACCCCAG ATGAGGTGGA CCAACTAAGC 4 80 

CAGGTGGCCG ACGGCACAGT ACCTGGCGCA AAAGCTGCTG CCATCCGCAA GTTGGAGCAC 54 0 

GAGCTGGGGA TACCAGCGCA CCAGCTGCCG GCAAGCGCGT TTCGCTTCCT CACGCGTTTG 600 

CACTACTGTG CCGCGGACGT GCAGCCGGCT GCGACACAAT CAGCGCTCTG GGGCGAGCAC 660 

GAGATGGACT ACATCTTATT CATCCGGGCC AACGTCACCT TGGCGCCCAA CCCTGACGAG 720 

GTGGACGAAG TCAGGTACGT GACGCAAGAG GAGCTGCGGC AG AT GAT GC A GCCGGACAAC 78 0 

GGGTTGCAAT GGTCGCCGTG GTTTCGCATC ATCGCCGCGC GCTTCCTTGA GCGTTGGTGG 84 0 

GCTGACCTGG ACGCGGCCCT AAACACTGAC AAACACGAGG ATTGGGGAAC GGTGCATCAC 900 

ATCAACGAAG CGTGAAGGCA GAAGCTGCAG GATGTGAAGA CACGTCATGG GGTGGAATTG 960 

CGTACTTGGC AGCTTCGTAT CTCCTTTTTC TGAGACTGAA CCTGCAGAGC TAGAGTCAAT 1020 

GGTGCATCAT ATTCATCGTC TCTCTTTTGT TTTAGACTAA TCTGTAGCTA GAGTCACTGA 1080 

TGAATCCTTT ACAACTTTCA AAAAAAAAA 1109 

<210> 31 
<211> 985 
<212> DNA 

<213> Lactuca sativa 
<400> 31 

TGCCAAAATG TTGAAATTTC CCCCTTTTAA AACCATTGCT AC CAT GAT CT CTTCTCCATA 60 

TTCTTCCTTC TTGCTGCCTC GGAAATCTTC TTTCCCTCCA ATGCCGTCTC TCGCAGCCGC 120 

TAGTGTTTTC CTCCACCCTC TTTCGTCTGC CGCTATGGGC GATTCCAGCA TGGATGCTGT 18 0 

CCAGCGACGT CTCATGTTCG ATGACGAATG CATTTTGGTG GATGAGAATG ACAAAGTGGT 24 0 

TGGCCATGAT ACTAAATACA ATTGTCATTT GATGGAGAAG ATTGAAAAGG GAAATATGCT 300 
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ACACAGAGCA TTCAGTGTGT TCTTGTTCAA 

TTCTGCAACC AAGGTGACTT TCCCTTTGGT 

ATACAGGGAG AGTGAGCTTA TTGACGAAAA 

GAAGCTCCTG GATGAACTCG GCATCCCTGG 

ATTGGGTCGC ATTCTATACA AGGCCGCATC 

TTACCTGCTG TTTATGGTAC GTGATGTTGG 

TGTAAAATAT GTGAACCGGG AAGAGCTGAA 

AGAGGGTGTG AAGCTGTCCC CGTGGTTCAA 

GTGGGATCGA CTCCATAAGG GAACCCTAAC 

ACTCACATAA AAACACTACA CTAGTAGGAG 

GAAATTGAAA TTCAGATGAA TGCTTGTATT 

TTTGCTACCT TATCAGAAAA AAAAA 

<210> 32 
<211> 988 
<212> DMA 

<213> Lactuca sativa 
<40O> 32 

TATTCGCTTC AAAATCTCTT CCATTAACTG 
TCCGCCGGCG CACTTTCACC ACCATAACCG 
CTGTCCAGAG ACGTCTCATG TTTGATGATG 
TTCTTGGGCA TGATACCAAA TACAATTGTC 
TGCTTCATAG AGCATTCAGT GTATTTTTAT 
AAAGGTCAGA AACCAAGGTG ACATTTCCTT 
CACTATACAG AGAATCGGAG TTAATTCCCG 
AGAGGAAGCT TCTAGATGAA CTCGGTATCC 
CAACTTTAGG TCGCATGTTG TACAAGGCTC 
TTGATTACCT ACTCTTCCTC GTGCGTGACG 
CGGACATTAG ATACGTGAAC CAAGAAGAGT 
GTGAGGAGGG TTTGAAATTG TCCCCATGGT 
AATGGTGGGA TCATGTCCAA AAGGGGACAC 
ATAAGTTGAT ATGAAAAATG GTTAATATTT 
TGTTCAAGTC TCGGTCCTTC TTTTTTTAAC 
TATTGTGTAC TTGTAACGTA GGCCCTTTGG 



CTCGAAATAT GAATTACTCC 

ATGGACAAAC ACGTGTTGCA 

CGCCCTTGGG GTGAGGAATG 

AGCAGATGTT CCGGTTGATG 

GGATGGAAAG TGGGGAGAAC 

TTTGGATCCG AACCCAGATG 
GGAATTGGTA AGGAAGGCGG 

ATTGATTGTC GATAATTTCT 

CGAAGCTATT GATATGAAAA 

AGAGGATTAT ATGAGATATT 

TATTTCTATT TGGACAAACT 



CTCAAATCTC CACCTTCGCC 
CCGCCATGGG TGACGATTCC 
AATGCATTTT GGTTGATGAA 
ACTTGATGGA GAAGATTGAG 
TCAATTCAAA ATACGAATTA 
TGGTATGGAC AAACACCTGT 
AAAATGCCCT TGGGGTCAGA 
CTGCTGAAGA TGTTCCAGTT 
CATCTGATGG AAAATGGGGT 
TTGCCGTGAA CCCAAACCCT 
TAAAAGAGTT ACTAAGGAAG 
TTAGGCTAGT GGTGGACAAC 
TCAATGAAGC AATTGACATG 
ATGGTGGTGG TTTGGAGCTA 
GTTTTTTTTT TTTCTTTTAT 
TTACGCTTTA AGAGTTTAAT 
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TTCAGCAACG 360 

GCCATCCACT 4 20 

CTGCACAGAG 4 80 

AGTTCACTCC 540 

ATGAACTTGA 600 

AAGTGAAAGA 660 

ATGCTGGTGA 720 

TGTTTCAGTG 780 

CAATCCACAA 840 

TGTTATATGT 900 

TCAACTTCTT 960 
985 



GGTCTTAATC 60 

GGCATGGACG 120 

AATGACAATG 180 

AAAGATAATT 24 0 

CTCCTTCAGC 300 

TGCAGCCATC 360 

AATGCTGCAC 420 

GATGAGTTCA 4 80 

GAACATGAAG 54 0 

GATGAGGTGG 600 

GCGGATGCGG 660 

TTCTTGTTCA 720 

AAAACCATTC 7 80 

ATAATTTGTG 84 0 

TGGGAGTGTT 900 

AAAGAACCAC 960 



33 
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CGTTAATTTA AAAAAAAAAA AAAAAAAA 988 

<210> 33 
<211> 1874 
<212> DNA 

<213> Chlamydomonas reinhardtii 
<400> 33 

GGCACGAGCT CGAGTTTGTT TTACCATGAC ATCGGGAATT TGGAAGCTTG AACTACCTCA 60 

ATTACTCAAG TAACTCGCGG C AAC AC AT T T CGCGCGCCAT CGCTGTTTTC TCTGCTCCAG 120 

CTACCGAGCA GCATTGCTTT AGATCGCTTT GATGTCATAA ACTCCCACTT ATATGAGATC 180 

CAGTTTCATC GAGCCCAAGC CCAGAGCGCA ACCTGTCTTA AGCCGCGGCA GGGCGTCCAT 24 0 

GCGCCTCGCG CAAAGCCGTG CTCTCGTTGC GCGTGTCAGC TCCGCCCTGT GGCCGGGAGC 300 

AGGACTTTCA CAGGCTCAAA GCGTTGCGGT GCGAATGGCG AGTTCGTCAA CCTGGGAAGG 360 

CACGGGCCTG AGCCAGGATG ACTTCATGCA GCGGGACGAG TGCTTGGTGG TGGACGAGCA 420 

GGACCGGCTG CTAGGCACCG CCAACAAGTA CGACTGCCAC CGCTTCGAGG CGGCCAAGGG 4 80 

CCAGCCCTGC GGCCGCCTGC ACCGCGCCTT CTCCGTGTTC CTGTTCAGCC CCGACGGCCG 54 0 

ACTGCTGCTG CAGCAGCGCG C AG C CAGCAA GGTGACGTTC CCGGGTGTGT GGACCAACAC 600 

CTGCTGCTCG CACCCGCTGG CGGGCCAGGC GCCGGACGAG GTGGACCTGC CGGCGGCGGT 660 

AGCCTCGGGC CAGGTGCCGG GCATCAAGGC GGCGGCGGTG CGCAAGCTGC AGCACGAGCT 720 

GGGGATACCG CCGGAGCAGG TTCCCGCCTC CTCCTTCTCC TTCCTCACGC GTCTGCACTA 780 

CTGCGCCGCC GACACCGCCA CGCACGGCCC GGCGGCGGAG TGGGGCGAGC ACGAGGTGGA 84 0 

CTACGTGCTG TTCGTGCGGC CGCAGCAGCC CGTCAGCCTG CAGCCCAACC CAGACGAGGT 900 

GGACGCCACG CGCTACGTGA CGCTGCCGGA GCTTCAGTCC ATGATGGCGG ACCCCGGCCT 960 

CAGCTGGAGC CCCTGGTTCC GCATCCTGGC CACACAGCCC GCCTTCCTGC CCGCCTGGTG 1020 

GGGCGACCTG AAGCGGCGCT GGCGCCCGGG CGGCAGCCGA CTGTCGGACT GGGGCACCAT 1080 

CCACCGCGTC ATGTGAAGAA AAAGGGGAAG CAGGGGCGGG AGCGGGGGAT GAATGGGAAT 114 0 

GTGAATGCGA TTGTGATGCG GCGTGGGATG AGGTCTGAAG ACAGGGGGAA AATCGGGGGG 1200 

CGGGCGTGAG CGTGTGTGTA CGTGAGCGAC AAAGCCGGGA GGCGGACCGC GCGATGGGTA 1260 

CATGTGTGTG CGGAGGGTCG GTGGGTCGGT CGGTTGCGCG GCATAGCGTG TTGTGTGTGT 1320 

GCGGCTGCAG GGGTATGTGG GCACCCGGGC ACGGAGGAGA AGGCACACGC AGGTGGCGCG 1380 

GAGGTGTGTC AGGGGCCATG GGCGGGCCTC ACTCCTGGTC GTGCCCAGTG GTCTCGTGGG 14 4 0 

CAGAGTGGCA GGGGCTGCAC CCATATGAGC GGCGCACTGC CGCGCTGGGC TAAGTCCTTA 1500 

TCACTTGGTG AGGTGGGGCG AGGTGGCTGT GGGCGGCGGG CGCAGTGGCA GAAGGACACG 1560 

GTGTGTGAGC GGTGGAGCTC TGGCCGTGCC GGCCGTGAGG GGCGGATAGC GATATGACGT 1620 
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TGTGCTTGGC CGCTGTAATG CGGGAGAATG TGCAGGCCGC GAGAAGCGGG CGGTGGCAGG 168 0 

AGGCCGCAGG CTGCAGCACC CGTTGGGGAG GTGCCACCTG CAGGCGCGGC GCCGGGCGGG 17 4 0 

CCTGAGTAAT GGGCGCCTGA GTAGTGGCGG CCACAGGAGG CGCAGGAGGC AGCAGCAGGA 1800 

GGACGAGCTG GAGGGACCCG TTGGCAACCC AAGGTTGCGC GTGTAACATA GTGGCCATAC 18 60 

AAAAAAAAAA AAAA 1814 

<210> 34 
<211> 954 
<212> DNA 

<213> Tagetes erecta 
<400> 34 

CCAAAAACAA CTCAAATCTC CTCCGTCGCT CTTACTCCGC CATGGGTGAC GACTCCGGCA 60 

TGGATGCTGT TCAGCGACGT CTCATGTTTG ACGATGAATG CATTTTGGTG GATGAGTGTG 12 0 

ACAATGTGGT GGGACATGAT ACCAAATACA ATTGTCACTT GATGGAGAAG ATTGAAACAG 180 

GTAAAATGCT GCACAGAGCA TTCAGCGTTT TTCTATTCAA TTCAAAATAC GAGTTACTTC 24 0 

TTCAGCAACG GTCTGCAACC AAGGTGACAT TTCCTTTAGT ATGGACCAAC ACCTGTTGCA 300 

GCCATCCACT CTACAGAGAA TCCGAGCTTG TTCCCGAAAA CGCCCTTGGA GTAAGAAATG 3 60 

CTGCACAGAG GAAGCTGTTG GATGAACTCG GTATCCCTGC TGAAGATGTT CCCGTTGATC 4 20 

AGTTTACTCC TTTAGGTCGC ATGCTCTACA AGGCTCCATC TGATGGAAAG TGGGGAGAAC 4 80 

ATGAACTTGA CTACCTACTT TTCATAGTGA GAGACGTTGC TGTAAACCCG AACCCAGATG 54 0 

AAGTGGCGGA TATCAAATAT GTGACCAGAA GAGTTAAAGG AGCTGCTAAG GAAAGCAGAT 600 

GCGGGGGAGG AGGGTTTGAA GCTGTCTCCA TGGTTCAGGT TAGTGGTTGA TAACTTCTTG 660 

TTCAAGTGGT GGGATCATGT GCAAAAGGGT ACACTCACTG AAGCAATTGA TATGAAAACC 720 

ATACACAAGC TGATATAGAA ACACACCCTC AACCGAAAAG TTCAAGCCTA ATAATTCGGG 7 80 

TTGGGTCGGG TCTACCATCA ATTGTTTTTT TCTTTTAAGA AGTTTTAATC TCTATTTGAG 84 0 

CATGTTGATT CTTGTCTTTT GTGTGTAAGA TTTTGGGTTT CGTTTCAGTT GTAATAATGA 900 

AC CAT T GAT G GTTTGCAATT TCAAGTTCCT ATCGACATGT AGTGATCTAA AAAA 954 

<210> 35 
<211> 1031 
<212> DNA 

<213> Oryza sativa 
<400> 35 

CCTCCCTTTG CCTCGCGCAG AGGCGGCCGC GCCTTCTCCG CCGCGAGGAT GGCCGGCGCC 60 

GCCGCCGCCG TGGAGGACGC CGGGATGGAC GAGGTCCAGA AGCGGCTCAT GTTCGACGAC 120 

GAATGCATTT TGGTGGATGA ACAAGACAAT GTTGTTGGCC ATGAATCAAA ATATAACTGC 180 

CATCTGATGG AAAAAATCGA ATCTGAAAAT CTACTTCATA GGGCTTTCAG TGTATTCCTG 240 
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TTCAACTCAA AATATGAACT CCTACTCCAG CAACGATCTG CAACAAAGGT TACATTTCCT 300 

CTAGTTTGGA CCAACACTTG CTGCAGCCAT CCTCTGTACC GTGAGTCTGA GCTTATACAG 3 60 

GAAAACTACC TTGGTGTTAG AAATGCTGCT CAGAGGAAGC TCTTGGATGA GCTGGGCATC 4 20 

CCAGCTGAAG ATGTGCCAGT TGACCAATTC ACCCCTCTTG GTCGGATGCT TTACAAGGCC 4 80 

CCATCTGATG GAAAATGGGG TGAACACGAG CTTGACTACC TGCTGTTCAT CGTCCGCGAC 54 0 

GTGAAGGTAG TCCCGAACCC GGACGAAGTG GCCGATGTGA AATACGTGAG CCGTGAGCAG 600 

CTGAAGGAGC TCATCCGCAA AGCGGACGCC GGAGAGGAAG GCCTGAAGCT GTCTCCCTGG 660 

TTCCGGCTGG TTGTTGACAA CTTCCTCATG GGCTGGTGGG ATCACGTCGA GAAAGGCACC 720 

CTCAACGAGG CCGTGGACAT GGAGACCATC CACAAGCTGA AGTAAGGACT GCGATGTTGT 7 80 

GGCTGGAAAG AATGATCCTG AAGACTCTGT TCTTGTGCTG CTGCATATTA CTCTTACCAG 84 0 

GGAAGTTGCA GAAGTCAGAA GAAGCTTTTG TATGTTTCTG GGTTTGGAGC TTGGAAGTGT 900 

TGGGCTCTGC TGACTGAGAG ATTCCCTTAT AGAGTGTCTA TGTTAATTTA GCAAACTTCT 960 

ATATTATACA TGATTAGTTA ATTGTTCGGT GTCTGAATAA AGAACAATAG CATGTTCCAT 1020 

GTTTATTTGC T 1031 

<210> 36 
<211> 232 
<212> PRT 

<213> Tagetes erecta 
<400> 36 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
15 10 15 

Asp Asp Glu Cys lie Leu Val Asp Glu Cys Asp Asn Val Val Gly His 
20 25 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Thr Gly Lys 
35 40 45 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

Val Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 110 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Gin Phe 
115 120 125 

Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 140 
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Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp Val Ala 
145 150 155 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp lie Lys Tyr Val Ser His 
165 170 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 

Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Thr Glu Ala lie Asp 
210 215 220 

Met Lys Thr lie His Lys Leu lie 
225 230 

<210> 37 
<211> 280 
<212> PRT 

<213> Lactuca Sativa 
<400> 37 

Met Leu Lys Phe Pro Pro Phe Lys Thr lie Ala Thr Met lie Ser Ser 
15 10 15 

Pro Tyr Ser Ser Phe Leu Leu Pro Arg Lys Ser Ser Phe Pro Pro Met 
20 25 30 

Pro Ser Leu Ala Ala Ala Ser Val Phe Leu His Pro Leu Ser Ser Ala 
35 40 45 

Ala Met Gly Asp Ser Ser Met Asp Ala Val Gin Arg Arg Leu Met Phe 
50 55 60 

Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val Gly His 
65 70 75 80 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Lys Gly Asn 
85 90 95 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
100 105 110 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
115 120 125 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
130 135 140 

lie Asp Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
145 150 155 160 

Leu Asp Glu Leu Gly lie Pro Gly Ala Asp Val Pro Val Asp Glu Phe 
165 170 175 

Thr Pro Leu Gly Arg lie Leu Tyr Lys Ala Ala Ser Asp Gly Lys Trp 
180 185 190 



Gly Glu His Glu Leu Asp Tyr Leu Leu Phe Met Val Arg Asp Val Gl} 
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195 200 205 

Leu Asp Pro Asn Pro Asp Glu Val Lys Asp Val Lys Tyr Val Asn Arg 
210 215 220 

Glu Glu Leu Lys Glu Leu Val Arg Lys Ala Asp Ala Gly Glu Glu Gly 
225 230 235 240 

Val Lys Leu Ser Pro Trp Phe Lys Leu lie Val Asp Asn Phe Leu Phe 
245 250 255 

Gin Trp Trp Asp Arg Leu His Lys Gly Thr Leu Thr Glu Ala lie Asp 
260 265 270 

Met Lys Thr lie His Lys Leu Thr 
275 280 

<210> 38 
<211> 229 
<212> PRT 

<213> Lactuca Sativa 
<400> 38 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
15 10 15 

Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Asn Val Leu Gly His 
20 25 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Lys Asp Asn 
35 40 45 

Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Glu Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

lie Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 110 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Glu Phe 
115 120 125 

Thr Thr Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 140 

Gly Glu His Glu Val Asp Tyr Leu Leu Phe Leu Val Arg Asp Val Ala 
145 150 155 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp lie Arg Tyr Val Asn Gin 
165 170 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 
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Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Asn Glu Ala lie Asp 
210 215 220 

Met Lys Thr lie His 
225 

<210> 39 
<211> 295 
<212> PRT 

<213> Adonis Palaestina 
<400> 39 

Met Ser Ser lie Arg lie Asn Pro Leu Tyr Ser lie Phe Ser Thr Thr 
15 10 15 

Thr Lys Thr Leu Ser Ala Ser Cys Ser Ser Pro Ala Val His Leu Gin 
20 25 30 

Gin Arg Cys Arg Thr Leu Ser lie Ser Ser Ser lie Thr Asn Ser Pro 
35 40 45 

Arg Arg Gly Leu Asn Arg Leu Phe Ala Ser Thr Ser Thr Met Gly Glu 
50 55 60 

Val Ala Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu Met Phe Asp 
65 70 75 80 

Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val Gly Tyr Asp 
85 90 95 

Ser Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala Glu Asn Leu 
100 105 110 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu 
115 120 125 

Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val Trp 
130 135 140 

Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser Glu Leu lie 
145 150 155 160 

Glu Glu Asn Phe Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Leu 
165 170 175 

Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Glu Phe Thr 
180 185 190 

Pro Leu Gly Arg lie Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly 
195 200 205 

Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp Val Lys Tyr 
210 215 220 

Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val Asn Arg Glu 
225 230 235 240 

Glu Leu Lys Glu lie Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly lie 
245 250 255 

Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe Lys 
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260 265 270 

Trp Trp Asp His Val Glu Glu Gly Lys lie Lys Asp Val Ala Asp Met 
275 280 285 

Lys Thr lie His Lys Leu Thr 
290 295 

<210> 40 
<211> 234 
<212> PRT 

<213> Adonis Palaestina 
<400> 40 

Met Gly Glu Val Thr Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu 
15 10 15 

Met Phe Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val 
20 25 30 

Gly His Asp Ser Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
35 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro 
65 70 75 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser 
85 90 95 

Glu Leu He Glu Glu Asn Tyr Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

Lys Leu Leu Asp Glu Leu Gly He Pro Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg He Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 150 155 160 

Val Lys Tyr Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val 
165 170 175 

Asn Arg Glu Glu Leu Arg Glu He Leu Arg Lys Ala Asp Ala Gly Glu 
180 185 190 

Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Phe Lys Trp Trp Asp His Val Glu Gin Gly Thr He Lys Glu Val 
210 215 220 

Ala Asp Met Lys Thr He His Lys Leu Thr 
225 230 

<210> 41 
<211> 238 
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<212> PRT 

<213> Oryza Sativa 
<400> 41 

Met Ala Gly Ala Ala Ala Ala Val Glu Asp Ala Gly Met Asp Glu Val 
15 10 15 

Gin Lys Arg Leu Met Phe Asp Asp Glu Cys lie Leu Val Asp Glu Gin 
20 25 30 

Asp Asn Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu Met Glu 
35 40 45 

Lys lie Glu Ser Glu Asn Leu Leu His Arg Ala Phe Sex Val Phe Leu 
50 55 60 

Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys 
65 70 75 80 

Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu 
85 90 95 

Tyr Arg Glu Ser Glu Leu lie Gin Glu Asn Tyr Leu Gly Val Arg Asn 
100 105 110 

Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Pro Ala Glu Asp 
115 120 125 

Val Pro Val Asp Gin Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala 
130 135 140 

Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe 
145 150 155 160 

lie Val Arg Asp Val Lys Val Val Pro Asn Pro Asp Glu Val Ala Asp 
165 170 175 

Val Lys Tyr Val Ser Arg Glu Gin Leu Lys Glu Leu lie Arg Lys Ala 
180 185 190 

Asp Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val 
195 200 205 

Val Asp Asn Phe Leu Met Gly Trp Trp Asp His Val Glu Lys Gly Thr 
210 215 220 

Leu Asn Glu Ala Val Asp Met Glu Thr lie His Lys Leu Lys 
225 230 235 

<210> 42 
<211> 233 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Met Thr Asp Ser Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu 
15 10 15 

Met Phe Glu Asp Glu Cys lie Leu Val Asp Glu Asn Asn Arg Val Val 
20 25 30 
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Gly His Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
35 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro 
65 70 75 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser 
85 90 95 

Glu Leu lie Glu Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

Lys Leu Phe Asp Glu Leu Gly lie Val Ala Glu Asd Val Pro Val Asp 
115 120 ' 125 

Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Val Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 150 155 160 

Val Lys Leu Gin Pro Asn Pro Asp Glu Val Ala Glu lie Lys Tyr Val 
165 170 175 

Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp 
180 185 190 

Glu Ala Val Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr lie Thr Glu Ala 
210 215 220 

Ala Asp Met Lys Thr lie His Lys Leu 
225 230 

<210> 43 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 43 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg Gly 
35 40 45 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala 
65 70 75 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
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85 90 95 

Leu His Arg- Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu 
100 105 110 

Leu Leu Gin Gin Arg Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 155 160 

Ala Ala Ala lie Arg Lys Leu Glu His Glu Leu Gly lie Pro Ala His 
165 170 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys 
180 185 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu 
195 200 205 

His Glu Met Asp Tyr lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg lie lie Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 265 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 285 

His lie Asn Glu Ala 
290 

<210> 44 
<211> 304 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 44 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser lie Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys lie 
65 70 75 80 
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Leu Val Asp Val Glu Asd Asn lie Thr Gly His Ala Ser Lys Leu Glu 
85 90 95 

Cys His Lys Phe Leu Pro His Pro Ala Gly Leu Leu His Arg Ala Phe 
100 105 110 

Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin Arg 
115 120 125 

Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr Asn Thr Cys Cys 
130 135 140 

Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu Ser 
145 150 155 160 

Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala lie Arg 
165 170 175 

Lys Leu Glu His Glu Leu Gly lie Pro Ala His Gin Leu Pro Ala Ser 
180 185 190 

Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val Gin 
195 200 205 

Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp Tyr 
210 215 220 

lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp Glu 
225 230 235 240 

Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met Met 
245 250 255 

Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg lie lie Ala 
260 265 270 

Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu Asn 
275 280 285 

Thr Asp Lys His Glu Asp Trp Gly Thr Val His His lie Asn Glu Ala 
290 295 300 

<210> 45 
<211> 307 
<212> PRT 

<213> Chlamydomonas reinhardtii 
<400> 45 

Met Arg Ser Ser Phe lie Glu Pro Lys Pro Arg Ala Gin Pro Val Leu 
15 10 15 

Ser Arg Gly Arg Ala Ser Met Arg Leu Ala Gin Ser Arg Ala Leu Val 
20 25 30 

Ala Arg Val Ser Ser Ala Leu Trp Pro Gly Ala Gly Leu Ser Gin Ala 
35 40 45 

Gin Ser Val Ala Val Arg Met Ala Ser Ser Ser Thr Trp Glu Gly Thr 
50 55 60 

Gly Leu Ser Gin Asp Asp Phe Met Gin Arg Asp Glu Cys Leu Val Val 
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Aso Glu Gin Asp Arg Leu Leu Gly Thr Ala Asn Lys Tyr Asp Cys His 
H 85 90 95 

Ara Phe Glu Ala Ala Lys Gly Gin Pro Cys Gly Arg Leu His Arg Ala 
9 100 105 110 

Phe Ser Val Phe Leu Phe Ser Pro Asp Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Ara Ala Ala Ser Lys Val Thr Phe Pro Gly Val Trp Thr Asn Thr Cys 
130 135 140 

Cvs Ser His Pro Leu Ala Gly Gin Ala Pro Asp Glu Val Asp Leu Pro 
145 150 155 160 

Ala Ala Val Ala Ser Gly Gin Val Pro Gly He Lys Ala Ala Ala Val 
165 170 175 

Arq Lys Leu Gin His Glu Leu Gly He Pro Pro Glu Gin Val Pro Ala 
180 185 190 

Ser Ser Phe Ser Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Thr 
195 200 205 

Ala Thr His Gly Pro Ala Ala Glu Trp Gly Glu His Glu Val Asp Tyr 
210 215 220 

Val Leu Phe Val Arg Pro Gin Gin Pro Val Ser Leu Gin Pro Asn Pro 
225 230 235 240 

Asp Glu Val Asp Ala Thr Arg Tyr Val Thr Leu Pro Glu Leu Gin Ser 
245 250 255 

Met Met Ala Asp Pro Gly Leu Ser Trp Ser Pro Trp Phe Arg He Leu 
260 265 270 

Ala Thr Gin Pro Ala Phe Leu Pro Ala Trp Trp Gly Asp Leu Lys Arg 
275 280 285 

Arq Trp Arg Pro Gly Gly Ser Arg Leu Ser Asp Trp Gly Thr He His 
290 295 300 

Arg Val Met 
305 

<210> 46 
<211> 1848 
<212> DNA 

<213> Adonis palaestina 

GAGAGAAAAA GAGTGTTATA TTAATGTTAC TGTCGCATTC TTGCAACACA TATTCAGACT 60 
CCATTTTCTT GTTTTCTCTT CAAAACAACA AACTAATGTG ACGGAGTATC TAGCTATGGA 120 
ACTACTTGGT GTTCGCAACC TCATCTCTTC TTGCCCTGTC TGGACTTTTG GAACAAGAAA 180 
CCTTAGTAGT TCAAAACTAG CTTATAACAT ACATCGATAT GGTTCTTCTT GTAGAGTAGA 24 0 
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TTTTCAAGTG AGGGCTGATG GTGGAAGCGG GAGTAGAACT TCTGTTGCTT ATAAAGAGGG 300 

TTTTGTGGAC GAGGAGGATT TTATCAAAGC TGGTGGTTCT GAGCTTTTGT TTGTCCAAAT 360 

GCAGCAAACA AAGTCTATGG AGAAACAGGC CAAGCTCGCC GATAAGTTGC CACCAATACC 4 20 

TTTCGGAGAA TCTGTGATGG ACTTGGTTGT AATAGGTTGT GGACCTGCTG GTCTTTCACT 4 80 

GGCTGCAGAA GCTGCTAAGC TAGGCTTGAA AGTTGGCCTT ATTGGTCCTG ATCTTCCTTT 54 0 

TACAAATAAT TATGGTGTGT GGGAAGACGA GTTCAAAGAT CTTGGACTTG AACGTTGTAT 600 

CGAGCATGCT TGGAAGGACA CCATCGTATA TCTTGACAAT GATGCTCCTG TCCTTATTGG 660 

TCGTGCATAT GGACGAGTTA GCCGGCATTT GCTGCATGAA GAGTTGCTGA AAAGGTGTGT 720 

CGAGTCAGGT GTATCATATC TGAATTCTAA AGTGGAAAGG ATCACTGAAG CTGGTGATGG 780 

CCATAGTCTT GTAGTTTGTG AAAACGACAT CTTTATCCCT TGCAGGCTTG CTACTGTTGC 84 0 
ATCTGGAGCA GCTTCAGGGA AACTTTTGGA GTATGAAGTA GGTGGCCCTC GTGTTTGTGT 900 
CCAAACTGCT TATGGTGTGG AGGTTGAGGT GGAGAACAAT CCATACGATC CCAACTTAAT 960 

GGTATTTATG GACTACAGAG ACTATATGCA AC AG AAAT T A CAGTGCTCGG AAGAAGAATA 1020 

TCCAACATTT CTCTATGTCA TGCCCATGTC GCCAACAAGA CTTTTTTTTG AGGAAACCTG 1080 

TTTGGCCTCA AAAGATGCCA TGCCTTTCGA TCTACTGAAG AGAAAACTAA TGTCACGATT 1140 

GAAGACTCTG GGTATCCAAG TTACAAAAAT TTATGAAGAG GAATGGTCTT ATATTCCTGT • 1200 

TGGGGGTTCT TTACCAAACA CAGAGCAAAA G AACC TAG C A TTTGGTGCTG CAGCAAGCAT 12 60 

GGTGCATCCA GCAACAGGCT ATTCGGTTGT ACGATCACTA TCAGAAGCTC CAAAATATGC 1320 

TTCTGTAATT GCAAAGATTT TGAAGCAAGA TAACTCTGCA TATGTGGTTT CTGGACAAAG 1380 

CAGTGCAGTA AACATTTCAA TGCAAGCATG GAGCAGTCTT TGGCCAAAGG AGCGAAAACG 14 4 0 

TCAAAGAGCA TTCTTTCTTT TCGGGTTAGA GCTTATTGTG CAGCTAGATA TTGAAGCAAC 1500 

CAGAACGTTC TTTAGAACCT TCTTCCGCTT GCCAACTTGG ATGTGGTGGG GTTTCCTTGG 1560 

GTCTTCACTA TCATCTTTCG ATCTTGTATT GTTTTCCATG TACATGTTTG TTTTGGCCCC 1620 

GAACAGCATG AGGATGTCAC TTGTGAGACA TTTGCTTTCA GATCCTTCTG GTGCAGTTAT 1680 

GGTTAAAGCT TACCTCGAAA GGTAATCTGT TTTATGAAAC TATAGTGTCT CATTAAATAA 1740 

ATGAGGATCC TTCGTATATG TATATGATCA TCTCTATGTA TATCCTATAT TCTAATCTCA 1800 

TAAAGTAATC GAAAATTCAT TGATAGAAAA AAAAAAAAAA AAAAAAAA 184 8 

<210> 47 
<211> 529 
<212> PRT 
. <213> Adonis palaestina 

<400> 47 , m 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 



ML 



1 
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5 10 15 



Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arq Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Glv Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 95 

Lvs Leu Pro Pro He Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 HO 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cvs He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
19 5 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Glv Pro Arq Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 
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Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val He Ala Lys He Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 48 
<211> 378 
<212> PRT 
<213> Potato 

<400> 48 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys He Glu His Val Trp 
15 10 15 

Arg Asp Thr He Val Tyr Leu Asp Asp Asp Asp Pro lie Leu He Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 

Arg He Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 80 

Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 
85 90 95 
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Ser Glv Lvs Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 105 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 170 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 190 

Asn Thr Leu Gly Val Arg He Lys Glu He Tyr Glu Glu Glu Trp Ser 
195 200 205 

Tyr He Pro Val Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu 
210 215 220 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn He Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser He Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu He Leu 
290 295 300 

Gin Leu Asp He Glu Gly He Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 310 315 320 

Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa 
325 330 335 

Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He He Ala Pro Asn 
340 345 350 

Asp Met Arg Arg Gly Leu He Arg His Leu Leu Ser Asp Pro Thr Gly 
355 360 365 

Ala Thr Leu He Arg Thr Tyr Leu Thr Phe 
370 375 



<210> 49 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 49 



PCT/US99/12121 

WO 99/63055 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser Tyr Arg Asn He Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 95 

Asp Lys Leu Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val 
100 105 HO 

Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 160 

Lvs Cvs He Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arq Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arq Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 



WO 99/63055 — PCT/US99/12121 

325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin lie Asn Ser Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

He Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu He Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 



<210> 50 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 50 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 
15 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn He 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe He Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
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85 90 95 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cys lie Glu His Ala Trp Lys Asp Thr lie Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu lie Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg lie Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu lie Phe lie Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 



STL 
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Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 



<210> 51 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 51 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
15 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Thr Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 95 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 



S3 
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Cys lie Glu His 



Ala Pro Val Leu 
180 



Leu His Glu Glu 

195 

Leu Asn Ser Lys 
210 

Leu Val Val Cys 
225 

Val Ala Ser Gly 



Gly Pro Arg Val 
260 

Glu Asn Asn Pro 
275 

Asp Tyr Met Gin 
290 

Phe Leu Tyr Val 
305 

Thr Cys Leu Ala 



Lys Leu Met Ser 
340 

Tyr Glu Glu Glu 
355 

Thr Glu Gin Lys 
370 

Pro Ala Thr Gly 
385 

Tyr Ala Ser Val 



Val Val Ser Gly 
420 

Ser Ser Leu Trp 
435 

Phe Gly Leu Glu 
450 

Phe Phe Arg Thr 
465 

Leu Gly Ser Ser 



Ala Trp Lys Asp 
165 

lie Gly Arg Ala 



Leu Leu Lys Arg 
200 

Val Glu Arg lie 
215 



Glu Asn Asp lie 
230 



Ala Ala Ser Gly 
245 

Cys Val Gin Thr 



Tyr Asp Pro Asn 
280 

Gin Lys Leu Gin 
295 

Met Pro Met Ser 
310 

Ser Lys Asp Ala 
325 

Arg Leu Lys Thr 



Trp Ser Tyr lie 
360 

Asn Leu Ala Phe 
375 

Tyr Ser Val Val 
390 

lie Ala Lys lie 
405 

Gin Ser Ser Ala 



Pro Lys Glu Arg 
440 

Leu He Val Gin 
455 

Phe Phe Arg Leu 
470 

Leu Ser Ser Phe 



Thr lie Val Tyr 
170 



Tyr Gly Arg Val 
185 

Cys Val Glu Ser 



Thr Glu Ala Gly 
220 

Phe He Pro Cys 
235 

Lys Leu Leu Glu 
250 

Ala Tyr Gly Val 
265 

Leu Met Val Phe 



Cys Ser Glu Glu 
300 

Pro Thr Arg Leu 
315 

Met Pro Phe Asp 
330 

Leu Gly He Gin 
345 

Pro Val Gly Gly 



Gly Ala Ala Ala 
380 

Arg Ser Leu Ser 
395 

Leu Lys Gin Asp 
410 

Val Asn He Ser 
425 

Lys Arg Gin Arg 



Leu Asp He Glu 
460 

Pro Thr Trp Met 
475 

Asp Leu Val Leu 



Leu Asp Asn Asp 
175 

Ser Arg His Leu 
190 

Gly Val Ser Tyr 
205 

Asp Gly His Ser 



Arg Leu Ala Thr 
240 

Tyr Glu Val Gly 
255 

Glu Val Glu Val 
270 



Met Asp Tyr Arg 
285 

Glu Tyr Pro Thr 



Phe Phe Glu Glu 
320 

Leu Leu Lys Arg 
335 

Val Thr Lys He 
350 



Ser Leu Pro Asn 
365 

Ser Met Val His 



Glu Ala Pro Lys 
400 

Asn Ser Ala Tyr 
415 

Met Gin Ala Trp 
430 

Ala Phe Phe Leu 
445 

Ala Thr Arg Thr 



Trp Trp Gly Phe 
480 

Phe Ser Met Tyr 
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485 



490 



495 



Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Lys Ala Tyr Leu Glu 
515 520 525 

Arg 



<210> 52 
<211> 533 
<212> PRT 
<213> Lettuce 

<400> 52 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
15 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn lie Arg His Lys Phe Ser Leu 
20 25 30 

Leu Lys Gin Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
35 40 45 

Gin lie Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50 55 60 

Gly lie Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin lie Pro lie Gly Asn Cys lie 
100 105 110 

Leu Asp Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu lie Gly Pro Asp 
130 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe lie Gly 
145 150 155 160 

Leu Gly Leu Glu Gly Cys lie Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro lie Arg lie Gly Arg Ala Tyr Gly Arg 
180 185 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu lie Glu Cys Glu Gly Asn He Thr He Pro 
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225 



230 



235 



240 



Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly 
260 265 270 

lie Glu Val Glu Val Glu Asn Asn Pro Tyr Asp Pro Asp Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Phe Ser Lys His Lys Pro Glu Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Val Met Ala Met Ser Pro Thr Lys 
305 310 315 320 

lie Phe Phe Glu Glu Thr Cys Leu Ala Ser Arg Glu Ala Met Pro Phe 
325 330 335 

Asn Leu Leu Lys Ser Lys Leu Met Ser Arg Leu Lys Ala Met Gly lie 
340 345 350 

Arg lie Thr Arg Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 400 

Ser Glu Ala Pro Asn Tyr Ala Ala Val lie Ala Lys lie Leu Arg Gin 
405 410 415 

Asp Gin Ser Lys Glu Met lie Ser Leu Gly Lys Tyr Thr Asn lie Ser 
420 425 430 

Lys Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg 
435 440 445 

Ala Phe Phe Leu Phe Gly Leu Ser His lie Val Leu Met Asp Leu Glu 
450 455 460 

Gly Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp Met 
465 470 475 480 

Trp Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu lie lie 
485 490 495 

Phe Ala Leu Tyr Met Phe Val lie Ala Pro His Ser Leu Arg Met Glu 
500 505 510 

Leu Val Arg His Leu Leu. Ser Asp Pro Thr Gly Ala Thr Met Val Lys 
515 520 525 

Ala Tyr Leu Thr lie 
530 



<210> 53 
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<211> 526 
<212> PRT 
<213> Tomato 

<400> 53 

Met Glu Cys Val Gly Val Gin Asn Val Gly Ala Met Ala Val Leu Thr 
15 10 15 

Arg Pro Arg Leu Asn Arg Trp Ser Gly Gly Glu Leu Cys Gin Glu Lys 
20 25 30 

Ser lie Phe Leu Ala Tyr Glu Gin Tyr Glu Ser Lys Cys Asn Ser Ser 
35 40 45 

Ser Gly Ser Asp Ser Cys Val Val Asp Lys Glu Asp Phe Ala Asp Glu 
50 55 60 

Glu Asp Tyr lie Lys Ala Gly Gly Ser Gin Leu Val Phe Val Gin Met 
65 70 75 80 

Gin Gin Lys Lys Asp Met Asp Gin Gin Ser Lys Leu Ser Asp Glu Leu 
85 90 95 

Arg Gin lie Ser Ala Gly Gin Thr Val Leu Asp Leu Val Val lie Gly 
100 105 110 

Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 
115 120 125 

Leu Asn Val Gly Leu Val Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
130 135 140 

Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys lie 
145 150 155 160 

Glu His Val Trp Arg Asp Thr lie Val Tyr Leu Asp Asp Asp Glu Pro 
165 170 175 

lie Leu lie Gly Arg Ala Tyr Gly Arg Val Ser Arg His Phe Leu His 
180 185 190 

Glu Glu Leu Leu Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn 
195 200 205 

Ser Lys Val Asp Arg lie Val Glu Ala Thr Asn Gly Gin Ser Leu Val 
210 215 220 

Glu Cys Glu Gly Asp Val Val lie Pro Cys Arg Phe Val Thr Val Ala 
225 230 235 240 

Ser Gly Ala Ala Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Ser Pro 
245 250 255 

Arg Val Ser Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn 
260 265 270 

Asn Pro Phe Asp Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr 
275 280 285 

Leu Arg His Asp Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu 
290 295 300 



WO 99/63055 



PCT/US99/12121 



Tyr Ala Met Pro Met 
305 

Leu Ala Ser Lys Asp 
325 

Met Leu Arg Leu Asn 
340 

Glu Glu Trp Ser Tyr 
355 

Gin Lys Thr Leu Ala 
370 



Thr Gly Tyr Ser Val 
385 

Ser Val Leu Ala Asn 
405 

Thr Ser Ser Ser lie 
420 

Trp Pro Gin Glu Arg 
435 

Ala Leu lie Leu Gin 
450 

Ala Phe Phe Arg Val 
465 

Ser Leu Ser Ser Ala 
485 

lie Ala Pro Asn Asp 
500 

Asp Pro Thr Gly Ala 
515 



Ser Pro Thr Arg Val 
310 

Ala Met Pro Phe Asp 
330 

Thr Leu Gly Val Arg 
345 

lie Pro Val Gly Gly 
360 

Phe Gly Ala Ala Ala 
375 



Val Arg Ser Leu Ser 
390 

lie Leu Arg Gin His 
410 

Pro Ser lie Ser Thr 
425 

Lys Arg Gin Arg Ser 
440 

Leu Asp lie Glu Gly 
455 

Pro Lys Trp Met Trp 
470 

Asp Leu Met Leu Phe 
490 

Met Arg Lys Gly Leu 
505 

Thr Leu lie Arg Thr 
520 



Phe Phe Glu Glu Thr Cys 
315 320 

Leu Leu Lys Lys Lys Leu 
335 

lie Lys Glu lie Tyr Glu 
350 

Ser Leu Pro Asn Thr Glu 

365 

Ser Met Val His Pro Ala 
380 

Glu Ala Pro Lys Cys Ala 
395 400 

Tyr Ser Lys Asn Met Leu 
415 

Gin Ala Trp Asn Thr Leu 
430 

Phe Phe Leu Phe Gly Leu 
445 

lie Arg Ser Phe Phe Arg 
460 

Gin Gly Phe Leu Gly Ser 
475 480 

Ala Phe Tyr Met Phe lie 
495 

lie Arg His Leu Leu Ser 
510 

Tyr Leu Thr Phe 
525 



<210> 54 
<211> 516 
<212> PRT 

<213> Tagetes erecta 
<400> 54 

Met Ser Met Arg Ala Gly His Met Thr Ala Thr Met Ala Ala Phe Thr 
15 10 15 

Cys Pro Arg Phe Met Thr Ser lie Arg Tyr Thr Lys Gin lie Lys Cys 
20 25 30 

Asn Ala Ala Lys Ser Gin Leu Val Val Lys Gin Glu lie Glu Glu Glu 
35 40 45 

Glu Asp Tyr Val Lys Ala Gly Gly Ser Glu Leu Leu Phe Val Gin Met 
50 55 60 
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Gin Gin Asn Lys Ser Met Asp Ala Gin Ser Ser Leu Ser Gin Lys Leu 
65 70 75 80 

Pro Arg Val Pro lie Gly Gly Gly Gly Asp Ser Asn Cys lie Leu Asp 
85 90 95 

Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Gly Glu 
100 105 110 

Ser Ala Lys Leu Gly Leu Asn Val Ala Leu lie Gly Pro Asp Leu Pro 
115 120 125 

Phe Thr Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe lie Gly Leu Gly 
130 ' 135 140 

Leu Glu Gly Cys lie Glu His Val Trp Arg Asp Thr Val Val Tyr Leu 
145 150 155 160 

Asp Asp Asn Asp Pro lie Leu lie Gly Arg Ala Tyr Gly Arg Val Ser 
165 170 175 

Arg Asp Leu Leu His Glu Glu Leu Leu Thr Arg Cys Met Glu Ser Gly 
180 185 190 

Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala Pro Asn 
19.5 - 200 205 

Gly Leu Ser Leu lie Glu Cys Glu Gly Asn lie Thr lie Pro Cys Arg 
210 215 220 

Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr 
225 230 235 240 

Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly lie Glu 
245 250 255 

Val Glu Val Glu Ser lie Pro Tyr Asp Pro Ser Leu Met Val Phe Met 
260 265 270 

Asp Tyr Arg Asp Tyr Thr Lys His Lys Ser Gin Ser Leu Glu Ala Gin 
275 280 285 

Tyr Pro Thr Phe Leu Tyr Val Met Pro Met Ser Pro Thr Lys Val Phe 
290 295 300 

Phe Glu Glu Thr Cys Leu Ala Ser Lys Glu Ala Met Pro Phe Glu Leu 
305 310 315 320 

Leu Lys Thr Lys Leu Met Ser Arg Leu Lys Thr Met Gly lie Arg lie 
325 330 335 

Thr Lys Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser 
340 345 350 

Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser 
355 360 365 

Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu 
370 375 380 

Ala Pro Asn Tyr Ala Ala Val lie Ala Lys lie Leu Gly Lys Gly Asn 
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385 390 395 400 

Ser Lys Gin Met Leu Asp His Gly Arg Tyr Thr Thr Asn lie Ser Lys 
405 410 415 

Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg Ala 
420 425 430 

Phe Phe Leu Phe Gly Leu Ala Leu lie Val Gin Met Asp lie Glu Gly 
435 440 445 

Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp 
450 455 460 

Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu lie lie Phe 
465 470 475 480 

Ala Phe Tyr Met Phe lie lie Ala Pro His Ser Leu Arg Met Gly Leu 
485 490 495 

Val Arg His Leu Leu Ser Asp Pro Thr Gly Gly Thr Met Leu Lys Ala 
500 505 510 

Tyr Leu Thr lie 

515 . 



<210> 55 
<211> 501 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 55 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Asp Phe Phe lie Pro 
15 10 15 

Gin Phe His Gly Phe Glu Arg Leu Cys Ser Asn Asn Pro Tyr His Ser 
20 25 30 

Arg Val Arg Leu Gly Val Lys Lys Arg Ala lie Lys lie Val Ser Ser 
35 40 45 

Val Val Ser Gly Ser Ala Ala Leu Leu Asp Leu Val Pro Glu Thr Lys 
50 55 60 

Lys Glu Asn Leu Asp Phe Glu Leu Pro Leu Tyr Asp Thr Ser Lys Ser 
65 70 75 80 

Gin Val Val Asp Leu Ala lie Val Gly Gly Gly Pro Ala Gly Leu Ala 
85 90 95 

Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp 
100 105 110 

Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp 
115 120 125 

Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr Thr Trp Ser 
130 135 140 

Gly Ala Val Val Tyr Val Asp Glu Gly Val Lys Lys Asp Leu Ser Arg 



to 
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145 150 155 160 

Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Leu Gin 
165 170 175 

Lys Cys lie Thr Asn Gly Val Lys Phe His Gin Ser Lys Val Thr Asn 
180 185 190 

Val Val His Glu Glu Ala Asn Ser Thr Val Val Cys Ser Asp Gly Val 
195 200 205 

Lys lie Gin Ala Ser Val Val Leu Asp Ala Thr Gly Phe Ser Arg Cys 
210 215 220 

Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr 
225 230 235 240 

Gly lie Val Ala Glu Val Asp Gly His Pro Phe Asp Val Asp Lys Met 
245 250 255 

Val Phe Met Asp Trp Arg Asp Lys His Leu Asp Ser Tyr Pro Glu Leu 
260 265 270 

Lys Glu Arg Asn Ser Lys lie Pro Thr Phe Leu Tyr Ala Met Pro Phe 
275 280 285 

Ser Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro 
290 295 300 

Gly Leu Arg Met Glu Asp lie Gin Glu Arg Met Ala Ala Arg Leu Lys 
305 310 315 320 

His Leu Gly lie Asn Val Lys Arg lie Glu Glu Asp Glu Arg Cys Val 
325 330 335 

lie Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly 
340 345 350 

lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val 
355 360 365 

Ala Arg Thr Leu Ala Ala Ala Pro lie Val Ala Asn Ala lie Val Arg 
370 375 380 

Tyr Leu Gly Ser Pro Ser Ser Asn Ser Leu Arg Gly Asp Gin Leu Ser 
385 390 395 400 

Ala Glu Val Trp Arg Asp Leu Trp Pro lie Glu Arg Arg Arg Gin Arg 
405. 410 415 

Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu Asp 
420 425 430 

Ala Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Gin Pro His Tyr 
435 440 445 

Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu Val 
450 455 460 

Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu 
465 470 475 480 
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lie Met Thr Lys Gly Thr Val Pro Leu Ala Lys Met lie Asn Asn Leu 
485 490 495 



Val Gin Asp Arg Asp 
500 



<210> 56 
<211> 502 
<212> PRT 

<213> Adonis palaestina 
<400> 56 

Met Asp Thr Leu Leu Arg Thr His Asn Lys Leu Glu Leu Leu Pro Thr 
15 10 15 

Leu His Gly Phe Ala Glu Lys Gin His Leu Val Ser Thr Ser Lys Leu 
20 25 30 

Gin Asn Gin Val Phe Arg lie Ala Ser Arg Asn lie His Pro Cys Arg 
35 40 45 

Asn Gly Thr Val Lys Ala Arg Gly Ser Ala Leu Leu Glu Leu Val Pro 
50 55 60 

Glu Thr Lys Lys Glu Asn Leu Glu Phe Asp Leu Pro Ala Tyr Asp Pro 
65 70 75 80 

Ser Arg Gly lie Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala 
85 90 95 

Gly Leu Ala lie Ala Gin Gin Val Ser Glu Ala Gly Leu Leu Val Cys 
100 105 110 

Ser lie Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val 
115 120 125 

Trp Val Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr 
130 135 140 

Thr Trp Ser Gly Ala Val Val Tyr Thr Asp Asp Asn Ser Lys Lys Tyr 
145 150 155 160 

Leu Asp Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys 
165 170 175 

Met Leu Gin Lys Cys Val Thr Asn Gly Val Lys Phe His Gin Ala Lys 
180 185 190 

Val lie Lys Val lie His Glu Glu Ser Lys Ser Leu Leu lie Cys Asn 
195 200 205 

Asp Gly lie Thr lie Asn Ala Thr Val Val Leu Asp Ala Thr Gly Phe 
210 215 220 

Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 
225 230 235 240 

Val Ala Tyr Gly lie Met Ala Glu Val Glu Glu His Pro Phe Asp Leu 
245 250 255 
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Asp Lys Met Leu Phe Met Asp Trp Arg Asp Ser His Leu Asn Glu Lys 
260 265 270 

Leu Glu Leu Lys Asp Lys Asn Arg Lys lie Pro Thr Phe Leu Tyr Ala 
275 280 285 

Met Pro Phe Ser Ser Thr Lys lie Phe Leu Glu Glu Thr Ser Leu Val 
290 295 300 

Ala Arg Pro Gly Leu Arg Phe Glu Asp lie Gin Glu Arg Met Val Ala 
305 310 315 320 

Arg Leu Lys His Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu 
325 330 335 

Arg Cys Val lie Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg 
340 345 350 

Val Val Gly lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly 
355 360 365 

Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Lys Ser 
370 375 380 

lie Val Gin Tyr Leu Gly Ser Asp Arg Ser Leu Ser Gly Asn Glu Leu 
385 390 395 400 

Ser Ala Glu Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg Gin 
405 410 415 

Arg Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu 
420 425 430 

Gin Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro His 
435 440 445 

Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu 
450 455 460 

Phe Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Ala Ser Arg lie 
465 470 475 480 

Glu lie Met Ala Lys Gly Thr Val Pro Leu Val Asn Met Met Asn Asn 
485 490 495 

Leu lie Gin Asp Thr Asp 
500 



<210> 57 
<211> 498 
<212> PRT 
<213> Pepper 

<400> 57 

Met Asp Thr Leu Leu Arg Thr Pro Asn Asn Leu Glu Phe Leu His Gly 
15 10 15 

Phe Gly Val Lys Val Ser Ala Phe Ser Ser Val Lys Ser Gin Lys Phe 
20 25 30 
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Gly Ala Lys Lys Phe Cys Glu Gly Leu Gly Ser Arg Ser Val Cys Val 
35 40 45 

Lys Ala Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys 
50 55 60 

Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val 
65 70 75 80 

Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val 
85 90 95 

Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro 
100 105 110 

Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu 
115 120 125 

Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly 
130 135 140 

Ala Ala Val Tyr lie Asp Asp Lys Thr Thr Lys Asp Leu Asn Arg Pro 
145 150 155 160 

Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met Gin Lys 
165 170 175 

Cys lie Leu Asn Gly Val Lys Phe His Gin Ala Lys Val lie Lys Val 
180 185 190 

lie His Glu Glu Ser Lys Ser Met Leu lie Cys Asn Asp Gly lie Thr 
195 200 205 

lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg Ser Leu 
210 215 220 

Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr Gly 
225 230 235 240 

lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys Met Val 
245 250 255 

Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Val Glu Leu Lys 
260 265 270 

Glu Arg Asn Ser Arg lie Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser 
275 280 285 

Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly 
290 295 300 

Leu Gly Met Asp Asp lie Gin Glu Arg Met Val Ala Arg Leu Ser His 
305 310 315 320 

Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu His Cys Val lie 
325 330 335 

Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly lie 
340 345 350 

Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala 
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355 

Arg Thr Leu Ala Ala 
370 

Leu Ser Ser Glu Arg 
385 

Trp Lys Asp Leu Trp 
405 

Cys Phe Gly Met Asp 
420 

Arg Phe Phe Asp Ala 
435 

Phe Leu Ser Ser Arg 
450 

Ser Leu Phe Ser His 
465 

Lys Gly Thr Leu Pro 
485 

Lys Glu 



360 

Ala Pro Val Val Ala Asn 
375 

Ser His Ser Gly Asp Glu 
390 395 

Pro lie Glu Arg Arg Arg 
410 

lie Leu Leu Lys Leu Asp 
425 

Phe Phe Asp Leu Glu Pro 
440 

Leu Phe Leu Pro Glu Leu 
455 

Ala Ser Asn Thr Ser Arg 
470 475 

Leu Val His Met lie Asn 
490 



365 

Ala lie lie Gin Tyr 
380 

Leu Ser Ala Ala Val 
400 

Gin Arg Glu Phe Phe 
415 

Leu Pro Ala Thr Arg 
430 

Arg Tyr Trp His Gly 
445 



He Val Phe Gly Leu 
460 

Leu Glu He Met Thr 
480 

Asn Leu Leu Gin Asp 
495 



<210> 58 
<211> 500 
<212> PRT 
<213> Tomato 



<400> 58 
Met Asp Thr Leu 
1 



His His Gly Phe 
20 



His Asn Phe Gly 
35 



Cys Val Lys Gly 
50 



Lys Lys Glu Asn 
65 



Gly Val Val Val 



Ala Val Ala Gin 
100 



Asp Pro Asn Pro 
115 



Asp Glu Phe Glu 



Leu Lys Thr Pro 
5 



Ala Val Lys Ala 



Ser Arg Lys Phe 
40 



Ser Ser Ser Ala 
55 



Leu Asp Phe Glu 
70 



Asp Leu Ala Val 
85 



Gin Val Ser Glu 



Lys Leu He Trp 
120 



Ala Met Asp Leu 



Asn Asn Leu Glu 
10 

Ser Thr Phe Arg 
25 

Cys Glu Thr Leu 



Leu Leu Glu Leu 
60 

Leu Pro Met Tyr 
75 

Val Gly Gly Gly 
90 



Ala Gly Leu Ser 
105 

Pro Asn Asn Tyr 



Leu Asp Cys Leu 



Phe Leu Asn Pro 
15 

Ser Glu Lys His 
30 

Gly Arg Ser Val 
45 

Val Pro Glu Thr 



Asp Pro Ser Lys 
80 

Pro Ala Gly Leu 
95 

Val Cys Ser He 
110 



Gly Val Trp Val 
125 

Asp Ala Thr Trp 
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130 135 140 

Ser Gly Ala Ala Val Tyr lie Asp Asp Asn Thr Ala Lys Asp Leu His 
145 150 155 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys lie Met Asn Gly Val Lys Phe His Gin Ala Lys Val lie 
180 185 190 

Lys Val lie His Glu Glu Ser Lys Ser Met Leu lie Cys Asn Asp Gly 
195 200 205 

He Thr He Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg 
210 215 220 

Ser Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala 
225 230 235 240 

Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys 
245 250 255 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Thr Asp 
260 265 270 

Leu Lys Glu Arg Asn Ser Arg He Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg 
290 295 300 

Pro Gly Leu Arg He Asp Asp lie Gin Glu Arg Met Val Ala Arg Leu 
305 310 315 320 

Asn His Leu Gly He Lys Val Lys Ser He Glu Glu Asp Glu His Cys 
325 330 335 

Leu He Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val 
340 345 350 

Gly He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

Gin Tyr Leu Gly Ser Glu Arg Ser His Ser Gly Asn Glu Leu Ser Thr 
385 390 395 400 

Ala Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg Gin Arg Glu 
405 410 415 

Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp 
435 440 445 

His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu He Val Phe 
450 455 460 
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Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Phe Glu lie 
465 470 475 480 

Met Thr Lys Gly Thr Val Pro Leu Val Asn Met lie Asn Asn Leu Leu 
485 490 495 

Gin Asp Lys Glu 
500 



<210> 59 
<211> 500 
<212> PRT 
<213> Tobacco 

<400> 59 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Glu Phe Leu His Pro 
15 10 15 

Val His Gly Phe Ser Val Lys Ala Ser Ser Phe Asn Ser Val Lys Pro 
20 25 30 

His Lys Phe Gly Ser Arg Lys lie Cys Glu Asn Trp Gly Lys Gly Val 
35 40 45 

Cys Val Lys Ala Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 55 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 80 

Gly Leu Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Val Ser lie 
100 105 110 

Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 125 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
130 135 140 

Ser Gly Thr Val Val Tyr lie Asp Asp Asn Thr Thr Lys Asp Leu Asp 
145 150 155 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys lie Leu Asn Gly Val Lys Phe His His Ala Lys Val lie 
180 185 190 

Lys Val lie His Glu Glu Ala Lys Ser Met Leu lie Cys Asn Asp Gly 
195 200 205 

Val Thr lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg 
210 215 220 

Cys Leu Val Gin Tyr Asp Lys Pro Tyr Lys Pro Gly Tyr Gin Val Ala 
225 230 235 240 
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Tyr Gly lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Thr Ser Lys 
245 250 255 

Met Val Leu Met Asp Trp Arg Asp Ser His Leu Gly Asn Asn Met Glu 
260 265 270 

Leu Lys Glu Arg Asn Arg Lys Val Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Lys lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg 
290 295 300 

Pro Gly Leu Arg Met Asp Asp lie Gin Glu Arg Met Val Ala Arg Leu 
305 310 315 320 

Asn His Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu His Cys 
325 330 335 

Val lie Pro Met Gly Gly Ser Leu Pro Val lie Pro Gin Arg Val Val 
340 345 350 

Gly Thr Gly Gly Thr Ala Gly Leu Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

His Tyr Leu Gly Ser Glu Lys Asp Leu Leu Gly Asn Glu Leu Ser Ala 
385 390 395 400 

Ala Val Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu 
405 410 415 

Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp 
435 440 445 

His Gly Phe Leu Ser Ser Arg Leu Tyr Leu Pro Glu Leu He Phe Phe 
450 455 460 

Gly Leu Ser Leu Phe Ser Arg Ala Ser Asn Thr Ser Arg He Glu He 
465 470 475 480 

Met Thr Lys Gly Thr Leu Pro Leu Val Asn Met He Asn Asn Leu Leu 
485 490 495 

Gin Asp Thr Glu 



<210> 60 
<211> 511 
<212> PRT 

<213> Tagetes erecta 
<400> 60 

Met Asp Thr Phe Leu Arg Thr Tyr Asn Ser Phe Glu Phe Val His Pro 
15 10 15 



500 
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Ser Asn Lys Phe 

20 

Ser Lys Ser Gin 
35 

Phe Lys Leu Gly 
50 

Leu Glu Leu Val 
65 

Pro Met Tyr Asp 



Gly Gly Gly Pro 
100 

Gly Leu Thr Val 
115 

Asn Asn Tyr Gly 
130 



Asp Cys Leu Asp 
145 

Lys Ser Thr Lys 



Gin Leu Lys Thr 
180 

Phe His Gin Ala 
195 

Leu Leu lie Cys 
210 

Asp Ala Thr Gly 
225 

Asn Pro Gly Tyr 



His Pro Phe Asp 
260 

His Leu Asp Gin 
275 

Thr Phe Leu Tyr 
290 

Glu Thr Ser Leu 
305 

Glu Arg Met Ala 



lie Glu Glu Asp 



Ala Gly Asn Leu 



Phe Gin Asp Phe 
40 

Gin Lys Tyr Cys 
55 

Pro Glu lie Lys 
70 

Pro Ser Arg Asn 
85 



Ser Gly Leu Ala 



Cys Ser lie Asp 
120 



Val Trp Val Asp 
135 

Thr Thr Trp Ser 
150 

Ser Leu Asn Arg 
165 

Lys Met Leu Gin 



Lys Val lie Lys 
200 

Asn Asp Gly Val 
215 

Phe Ser Arg Ser 
230 

Gin Val Ala Tyr 
245 

Val Asp Lys Met 



Asn Leu Glu lie 
280 

Ala Met Pro Phe 
295 



Val Ala Arg Pro 
310 



Tyr Arg Leu Lys 
325 



Glu Arg Cys Val 



Asn Asn Leu Asn 
25 

Arg Phe Gly Pro 



Val Lys Ala Ser 
60 

Lys Glu Asn Leu 
75 

Val Val Val Asp 
90 

Val Ala Gin Gin 
105 

Pro Ser Pro Lys 



Glu Phe Glu Ala 
140 

Ser Ala Val Val 
155 

Pro Tyr Ala Arg 
170 

Lys Cys lie Ala 
185 

Val He His Glu 



Thr He Gin Ala 
220 

Leu Val Gin Tyr 
235 

Gly He Leu Ala 
250 

Leu Phe Met Asp 
265 

Lys Ala Arg Asn 



Ser Ser Thr Arg 
300 

Gly Leu Lys Met 
315 



His Leu Gly He 
330 

lie Pro Met Gly 



Gin Leu Asn Gin 
30 



Lys Lys Ser Gin 
45 

Ser Ser Ala Leu 



Asp Phe Asp Leu 
80 

Leu Val Val Val 
95 

Val Ser Glu Ala 
110 

Leu He Trp Pro 
125 

Met Asp Leu Leu 



Tyr He Asp Glu 
160 

Val Asn Arg Lys 
175 

Asn Gly Val Lys 
190 

Glu Leu Lys Ser 
205 

Thr Leu Val Leu 



Asp Lys Pro Tyr 
240 

Glu Val Glu Glu 
255 

Trp Arg Asp Ser 
270 



Ser Arg He Pro 
285 

He Phe Leu Glu 



Glu Asp He Gin 
320 

Lys Val Lys Ser 
335 

Gly Pro Leu Pro 



Us 



p 
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340 345 350 

Val Leu Pro Gin Arg Val Leu Gly lie Gly Gly Thr Ala Gly Met Val 
355 360 365 

His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro 
370 375 380 

lie Val Ala Lys Ser lie lie Arg Tyr Leu Asn Asn Glu Lys Ser Met 
385 390 395 400 

Val Ala Asp Val Thr Gly Asp Asp Leu Ala Ala Gly lie Trp Arg Glu 
405 410 415 

Leu Trp Pro lie Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly 
420 425 430 

Met Asp lie Leu Leu Lys Leu Asp Leu Glu Gly Thr Arg Arg Phe Phe 
435 440 445 

Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser 
450 455 460 

Ser Arg Leu Phe Leu Pro Glu Leu Val Thr Phe Gly Leu Ser Leu Phe 
465 470 475 480 

Gly His Ala Ser Asn Thr Cys Arg Val Glu lie Met Ala Lys Gly Thr 
485 490 495 

Leu Pro Leu Ala Thr Met lie Gly Asn Leu Val Arg Asp Arg Glu 
500 505 510 



<210> 61 
<211> 503 
<212> PRT 
<213> Daffodil 

<400> 61 
Met Asp Thr Leu 
1 

Leu His Glu Leu 
20 

Gin Asn Pro Asn 
35 

Cys Arg Asn Gly 
50 

Val Pro Glu He 
65 

Asp Pro Ser Lys 



Pro Leu Ala Arg 
100 



Leu Arg Thr His 
5 

Ala Lys Arg His 



Phe Lys Phe Phe 
40 

Tyr He Gly Val 
55 

Lys Lys Glu His 
70 

Ala Leu Thr Leu 
85 

Ser Cys Ser Thr 



Asn Arg Leu Glu 
10 

Phe Leu Ser Pro 
25 

Ser Arg Lys Pro 



Ser Ser Asn Gin 
60 

Leu Glu Phe Asp 
75 

Asp Leu Ala Val 
90 

Ser Leu Gly Gly 
105 



Leu Leu Tyr Pro 
15 

Ser Pro Asn Pro 
30 



Tyr Gin Lys Lys 
45 

Leu Leu Asp Leu 



Leu Pro Leu Tyr 
80 

Val Gly Gly Gly 
95 

Gly Leu Ser Val 
110 



Val Ser He Asp Pro Asn Pro Lys Leu He Trp Pro Asn Asn Tyr Gly 
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115 120 125 

Val Trp Val Asp Glu Phe Glu Asp Met Asp Leu Leu Asp Cys Leu Asp 
130 135 140 

Ala Thr Trp Ser Gly Ala lie Val Tyr Val Asp Asp Arg Ser Thr Lys 
145 150 155 160 

Asn Leu Ser Arg Pro Tyr Ala Arg Val Asn Arg Lys Asn Leu Lys Ser 
165 170 175 

Lys Met Met Lys Lys Cys Val Ser Asn Gly Val Arg Phe His Gin Ala 
180 185 190 

Thr Val Val Lys Ala Met His Glu Glu Glu Lys Ser Tyr Leu lie Cys 
195 200 205 

Ser Asp Gly Val Thr lie Asp Ala Arg Val Val Leu Asp Ala Thr Gly 
210 215 220 

Phe Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr 
225 230 235 240 

Gin Val Ala Tyr Gly lie Leu Ala Glu Val Glu Glu His Pro Phe Asp 
245 250 255 

Val Asp Lys Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Gly 
260 265 270 

Lys Ala Glu Leu Asn Glu Arg Asn Ala Lys lie Pro Thr Phe Leu Tyr 
275 280 285 

Ala Met Pro Phe Ser Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu 
290 295 300 

Val Ala Arg Pro Gly Leu Lys Met Glu Asp lie Gin Glu Arg Met Val 
305 310 315 320 

Ala Arg Leu Asn His Leu Gly lie Arg lie Lys Ser lie Glu Glu Asp 
325 330 335 

Glu Arg Cys Val lie Pro Met Gly Gly Pro Leu Pro Val lie Pro Gin 
340 345 350 

Arg Val Val Gly lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr 
355 360 365 

Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro lie Val Ala Asn 
370 375 380 

Ser lie Val Gin Tyr Leu Val Ser Asp Ser Gly Leu Ser Gly Asn Asp 
385 390 395 400 

Leu Ser Ala Asp Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg 
405 410 415 

Gin Arg Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp 
420 425 430 

Leu Glu Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro 
435 440 445 
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Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu 
450 455 460 

Val Pro Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Cys Lys 
465 470 475 480 

Leu Glu lie Met Ala Lys Gly Thr Leu Pro Leu Val Asn Met lie Asn 
485 490 495 

Asn Leu Val Gin Asp Arg Asp 
500 
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